SRE Manager

VirtualVocations
Santa Clara, California, United States
Full-time
We are sorry. The job offer you are looking for is no longer available.

A company is looking for a Site Reliability Engineering Manager.Key Responsibilities : Lead and mentor a team of SREs, fostering a culture of continuous improvement and innovationCollaborate with product and engineering teams to design and implement scalable solutionsDevelop and maintain a reliable monitoring and alerting system to detect and mitigate issues proactivelyRequired Qualifications : Minimum of 8 years of experience in SRE, DevOps, or similar roles, with at least 2+ years in a leadership position with direct reportsExperience leading geographically dispersed teamsProficiency in programming languages such as Python, Go, or JavaExtensive experience with cloud services (AWS, GCP, Azure) and container orchestration tools (Kubernetes, Docker)Bachelor's degree in computer science, Engineering, or a related field; Master's preferred

16 days ago
Related jobs
Promoted
VirtualVocations
Santa Clara, California

...

Promoted
Apple
Cupertino, California

As a Site Reliability Engineering Manager, responsibilities include: - Lead SRE teams responsible for reliability and performance of on-prem and cloud-based services - Leading and growing the engineers on your team - Manage staging and production environments with goal of maximizing availability - P...

Promoted
VirtualVocations
Santa Clara, California

A company is looking for a Site Reliability Engineering Manager. ...

Promoted
Apple
Cupertino, California

We are looking for passionate and talented Site Reliability Engineering Manager to continue our focus in providing our customers the highest quality Apple Services experience. If you love designing, engineering and running systems and infrastructure that will help millions of customers, then this is...

TikTok
San Jose, California

MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so o...

Promoted
Apple
Cupertino, California

As a Site Reliability Engineering Manager, responsibilities include: - Lead SRE teams responsible for reliability and performance of on-prem and cloud-based services - Run staging and production environments with goal of maximizing uptimes - Promote observability of systems for monitoring, alerting,...

NVIDIA
Santa Clara, California

As a Sr Manager in Site Reliability Engineering (SRE), you will lead a team dedicated to the design, construction, and maintenance of expansive production systems, emphasizing high efficiency and availability. SRE Senior Managers bring specialized expertise in areas such as systems, networking, stor...

ByteDance
San Jose, California

Our Site Reliability Engineering (SRE) team combines software and systems to create and run large, distributed systems reliably. As a Technical Lead Manager, you'll lead a group of software/system engineers. Minimum of seven years of work experience in software development or SRE, particularly in th...

Intuit
Mountain View, California

Intuit is the global financial technology platform that powers prosperity for the people and communities we serve.With approximately 100 million customers worldwide using products such as TurboTax, Credit Karma, QuickBooks, and Mailchimp, we believe that everyone should have the opportunity to prosp...

ByteDance
San Jose, California

MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so o...