Search jobs > Santa Clara, CA > Site reliability engineer

Lead Site Reliability Engineer

VirtualVocations
Santa Clara, California, United States
Full-time

A company is looking for a Lead Site Reliability Engineer.Key Responsibilities : Investigating operational surprises and supporting teams in post incident activitiesConducting in-depth incident analysis and maximizing post incident learning across the organizationCompleting short-term reliability consultancy and enablement engagements such as SLO reviews and facilitating pre-mortemsRequired Qualifications : Solid experience in logging, monitoring, and observability of a highly distributed systemLeading incident management and response efforts, including critical, complex, and high severity incidentsExperience working in a tech or product company with comparable scale and complexityProficiency in one or more object-oriented programming languages or experience with infrastructure-as-codeExperience in technical leadership and leading delivery of technical initiatives in an operational capacity

2 days ago
Related jobs
Promoted
VirtualVocations
San Jose, California

A company is looking for a Senior Site Reliability Engineer. ...

Promoted
Apple Inc.
Cupertino, California

The Apple Service Engineering - Redis SRE team is looking for Site Reliability Engineers with experience in developing processes, tools, and automation for managing distributed systems in production environments. This role is for engineers who enjoy deep technical engineering that spans large cross-...

Promoted
Nvidia Corporation
Santa Clara, California

Similar Jobs (5) Senior Site Reliability Engineer - Storage locations US, CA, Santa Clara time type Full time posted on Posted 30+ Days Ago Senior Site Reliability Engineer - GeForce NOW locations 2 Locations time type Full time posted on Posted 7 Days Ago Senior Site Reliability Engineer, Data Scie...

Promoted
palo_alto_networks
Santa Clara, California

We’re looking for great SREs, as well as software engineers interested in production engineering, to help us scale the largest enterprise security cloud infrastructure in the world. Our cloud infrastructure is home to a series of massive and complicated distributed systems and virtualization softwar...

Promoted
CENTRL
Mountain View, California

In this leadership role, you will oversee the strategic direction, planning, and execution of our cloud and infrastructure operations to ensure the high availability, scalability, and performance of our IT systems. Balance feature development speed and reliability with well-defined service level obj...

Promoted
Apple Inc.
Cupertino, California

We are looking for passionate and talented Site Reliability Engineers to continue our focus on providing our customers the highest quality Apple Services experience. Our team leads the reliability engineering for iCloud Identity core services. Lead data-driven roadmap, quarterly planning for a subse...

Promoted
Sustainable Talent
CA, United States

Senior Site Reliability Engineer. As an SRE, you will be troubleshooting and managing our client's on-premises infrastructure to support various software engineering teams' company wide. ...

General Motors
Palo Alto, California

Chaos engineering implementation and experience a big plus. You have a story to tell how you lead and influence cross-organization effort to improve uptime to at least 99. BS/MS in Computer Science/Engineering preferred. You have a story to tell how you lead and influence cross-organization effort t...

ByteDance
San Jose, California

TEAM INTRODUCTION Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. Establish sustainable mechanisms for scaling systems, such as automation, to drive enhancements in reliability, efficiency, and velocity. ...

Illumio
Sunnyvale, California

Our Engineering team has established a culture based on thought leadership, independence, and responsibility. This role will be onsite in Sunnyvale, CA HQ five days a week. As an SRE/DevOps Engineer, you will be responsible for designing, implementing, and managing our cloud infrastructure on Azure,...