Search jobs > Chicago, IL > Site reliability engineer

Engineer II, Site Reliability

Oak Street Health
Chicago, Illinois
Full-time

Role Description

As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team.

From design to implementation, you will partner with our stellar software engineering teams in a fast-paced, agile environment to transform ideas into a reality.

Utilizing modern methodologies and open source tools, you will be empowered to set the engineering excellence standards as we seek to deliver applications that will directly and immediately impact the experience of our teams and our patients.

Core Responsibilities

Review systems to identify and implement the necessary telemetry, monitoring and alerting for proactive and reactive management.

Partner with Product and AD to define / review Service Level Objectives and Service Level Agreements.

Participate in design reviews to ensure solutions can meet SLO's / SLA's.

Design and automate performance and resiliency test cases in partnership with application development and infra teams.

Identify and eliminate manual repeatable tasks with automation or application enhancements partnering with development.

Other duties, as assigned.

What are we looking for?

Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience

Minimum of 3 years of development experience in consumer facing products leveraging cloud native technologies

Experience automating pipelines using continuous delivery tools.

Experience with system monitoring, alerting and observability platform tools and best practices.

Experience with capacity planning and management.

Experience with resilient systems, resiliency testing and design best practices.

Experience with nonfunctional requirements along with SLO's / SLA's.

Preferred : Our Tech Stack : Istio, Grafana Labs,.NET Core, Confluent Kafka, Mongo, gRPC, AKS, Docker, Azure

Preferred : Experience managing Kubernetes clusters in a production environment

Preferred : Experience monitoring applications at scale using Microservices

US Work Authorization

Someone who embodies being 'Oaky'

30+ days ago
Related jobs
Promoted
VirtualVocations
Chicago, Illinois

...

Promoted
Capital One
Chicago, Illinois
Remote

Site Reliability Engineer - Backend, Shopping (Remote-Eligible). If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at 1-800-304-9102 or via email at. As a Capital On...

Promoted
VirtualVocations
Chicago, Illinois

A company is looking for a Site Reliability Engineer to help automate deployments and enhance data synchronization services. ...

Bank of America
Chicago, Illinois

We are seeking a talented and experienced Key Management Service (KMS) Service Reliability Engineer (SRE) to join our team. In this role, you will be responsible for ensuring reliability, stability, and security of a robust enterprise key management infrastructure. Work closely with our CIOs , engin...

Promoted
VirtualVocations
Chicago, Illinois

...

Oracle
Chicago, Illinois

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Are you a seasoned Site Reliability Engineer or Cloud DevOps guru?. Articulate technical characteristics of services and technology areas and guide Development Te...

Intone Networks
Chicago, Illinois

KFORCE URGENT REQUIREMENT Looking for candidates regarding the following: POSITION Site Reliability Engineer (GCP) LOCATION Midwest area, travel once a month. ...

American College of Surgeons
Chicago, Illinois

This role involves handling complex issues, providing high-level technical support, and leading efforts to improve application reliability and performance. ...

Federal Reserve System
Chicago, Illinois
Remote

As a Senior Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program. The team uses open source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions. You will work closely with Engineers ...

Expedia Group
Chicago, Illinois

Site Reliability Engineering or related/relevant experience in software engineering. Senior Software Development Engineer - Site Reliability. Senior Software Development Engineer (SRE) to join our team. With our focus on reliability and resilience,. ...