Staff Site Reliability Engineer - AWS EKS

VirtualVocations
Santa Clara, California, United States
Full-time
We are sorry. The job offer you are looking for is no longer available.

A company is looking for a Staff Site Reliability Engineer - AWS / EKSKey Responsibilities : Support stability, reliability, and scalability of distributed systemsIdentify areas for improvement and perform technical reviewsDesign monitoring systems and implement automation strategiesQualifications : 7+ years of Site Reliability Engineering experience5+ years of orchestration systems experience (e.

g., Kubernetes)Experience with scripting languages and Infrastructure as CodeKnowledge of AWS and familiarity with other cloud platformsExperience with CI / CD tools and deployment strategies

9 days ago
Related jobs
Promoted
Silver Valley Metals Corporation, site: Bunker Hill Mine
Palo Alto, California

Site Reliability Engineer, Production Engineer, Platform Engineer). Collaborate, partner, advise, review and mentor engineering teams on Reliability topics like high reliability architecture, observability, safe change management. As an engineer in the Infrastructure department at Alchemy, you will ...

Zscaler
San Jose, California

Ensure new services and iterations of existing services are built for reliability, scalability and ease of operations. Champion SRE principles and practices within Engineering Department. Experience in being able to effectively lead a team of cross-functional engineers. Bachelor's degree in Computer...

NetApp
Santa Clara, California

Title: Site Reliability Engineer. Cloud, Linux, Software Engineer, Developer, Java, Technology, Engineering. As a Seasoned Software Engineer, you will be involved in both the SRE operations as well as monitoring using Dynatrace/Instana. The resource should be involved in SRE operations like OS patch...

Rivian
Palo Alto, California

As a Supplier Reliability Engineer, you will play a key role in designing Rivian's products with reliability in mind. Influence supplier selection for higher reliability and provide clear guidance on reliability requirements and demonstration to suppliers. Reliability is at the core of Electric Adve...

Apple
Cupertino, California

We are looking for seasoned software and systems engineers to join the Block Storage SRE team at Apple. This engineer’s work will affect hundreds of millions of users and be essential to the success of some of the most visible current and future Apple features. We think critically and strive to bala...

Splunk Inc
California, United States

Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...

ByteDance
San Jose, California

Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. Establish sustainable mechanisms for scaling systems, such as automation, to drive enhancements in reliability, efficiency, and velocity. ...

TikTok
Mountain View, California

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the Ads data platform area, you will have the opportunity to manage the services and infrastructures in one...

Bayside Solutions
CA, United States

Along with CloudStack/OpenStack, Virtualization and Linux, really needing the below experience as well.Kickstart and Bootstrap, as well as deployment to 100k servers across different data centers simultaneously.Additionally, experience with load balancers, high availability (HA), and failover proces...

ByteDance
San Jose, California

Therefore, we set up an engineer team with high talent density, mainly focusing on AI technology and Privacy&Security in CapCut. ...