Search jobs > Chicago, IL > Site reliability engineer

Engineer I, Site Reliability

Oak Street Health
Chicago, Illinois
Full-time

Role Description

As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team.

From design to implementation, you will partner with our stellar software engineering teams in a fast-paced, agile environment to transform ideas into a reality.

Utilizing modern methodologies and open source tools, you will be empowered to set the engineering excellence standards as we seek to deliver applications that will directly and immediately impact the experience of our teams and our patients.

Core Responsibilities

Review systems to identify and implement the necessary telemetry, monitoring and alerting for proactive and reactive management.

Partner with Product and AD to define / review Service Level Objectives and Service Level Agreements.

Participate in design reviews to ensure solutions can meet SLO's / SLA's.

Design and automate performance and resiliency test cases in partnership with application development and infra teams.

Identify and eliminate manual repeatable tasks with automation or application enhancements partnering with development.

Other duties, as assigned.

What are we looking for?

Bachelors or Relevant industry experience

Minimum of 3 years of development experience in consumer facing products leveraging cloud native technologies

Experience automating pipelines using continuous delivery tools.

Experience with system monitoring, alerting and observability platform tools and best practices.

Experience with capacity planning and management.

Experience with resilient systems, resiliency testing and design best practices.

Experience with nonfunctional requirements along with SLO's / SLA's.

Preferred : Our Tech Stack â Istio, Grafana Labs,.NET Core, Confluent Kafka, Mongo, gRPC, AKS, Docker, Azure

Preferred : Experience managing Kubernetes clusters in a production environment

Preferred : Experience monitoring applications at scale using Microservices

US Work Authorization

Someone who embodies being 'Oaky'

30+ days ago
Related jobs
Promoted
Grubhub
Chicago, Illinois

The Platform Engineering team is responsible for designing & implementing both our cloud platform and the software frameworks from which Grubhub engineers build business applications. Our customers are other engineers – they want to call API’s, read & write from data stores, queue jobs, and write bu...

Promoted
Dunhill Professional Search & Government Solutions
Chicago, Illinois
Remote

The Site Reliability Engineer will be joining a team responsible for developing and maintaining tools, alerts, and dashboards to support the Technical Operations team in monitoring application health and performance. The engineer will be responsible for implementing improvements to processes to impr...

Promoted
Matlen Silver
Chicago, Illinois

As a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel strategy. Job Title: Site Reliability Engineer. Hybrid: 2 Days Onsite Chicago Illinois. You w...

Oak Street Health
Chicago, Illinois

As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our stellar software engineering teams in a fast-paced, agile environ...

Splunk Inc
Illinois, United States

Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...

iManage
Chicago, Illinois

Being a Principal Site Reliability Engineer at iManage Means… You are a Principal SRE who is interested in building something from the ground up with our new and exciting cloud platform. Here is what one of our leaders, Principal Site Reliability Engineer ( Vy Silgalis ) has to say about ...

Fetch Rewards
Chicago, Illinois
Remote

The Site Reliability Engineering (SRE) team combines software and systems engineering to build and run distributed, fault-tolerant systems at scale. We’re proud to be our engineers’ engineers, and much of our software development focuses on optimizing existing systems, building infrastructure, and e...

Reveal
Chicago, Illinois

We are primarily looking for engineers that can help us develop our SaaS infrastructure capabilities. Ensuring the reliability, availability, and performance of systems and services by implementing monitoring, incident response, and post-incident analysis. Collaborating with cross-functional teams, ...

Oak Street Health
Chicago, Illinois

Lead Engineer - Site Reliability Engineer (SRE). As a Lead Engineer - Site Reliability Engineer (SRE), you will play a critical role in leading the design, implementation, and maintenance of highly available and scalable systems. Site Reliability Engineering or similar role. You will leverage your e...

Circle
Chicago, Illinois

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle's infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions. Staff Site Reliability Engineer (IV). Senior Site Reliability Engineer (III). Senior Sit...