Principal Site Reliability Engineer

VirtualVocations
Seattle, Washington, United States
Full-time

A company is looking for a Principal Site Reliability Engineer to ensure the uptime, performance, and scalability of its critical infrastructure.

Key Responsibilities : Develop and implement automation solutions to streamline operationsDesign and implement effective monitoring and alerting systemsOwn the incident lifecycle, leading root cause analysis and resolutionRequired Qualifications : Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)Minimum of ten (10) years of experience in a Site Reliability Engineering or related roleStrong understanding of system administration, Linux, and scripting languages (Python and various shells)Expertise in container orchestration (Kubernetes and Docker)Experience with cloud platforms and infrastructure management tools (AWS, Ansible, Terraform, etc.)

3 days ago
Related jobs
Promoted
VirtualVocations
Seattle, Washington

A company is looking for a Site Reliability Engineer responsible for designing, building, and maintaining infrastructure for highly available solutions. ...

Promoted
Apple
Seattle, Washington

We are looking for passionate and talented Site Reliability Engineer to continue our focus in providing our customers the highest quality Apple Services experience. We are seeking a highly skilled and motivated Security Site Reliability Engineer (SRE) to join our dynamic and growing team. Understand...

Promoted
Microsoft
Redmond, Washington

Site Reliability Engineering IC3 - The typical base pay range for this role across the U. OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration. This is a fantasti...

Promoted
Apple
Seattle, Washington

We are looking for passionate and talented Site Reliability Engineers to continue our focus in providing our customers the highest quality Apple Services experience. The Apple Service Engineering(ASE) team builds and provides systems and infrastructure that fuel Apple's services (such as iCloud, iTu...

Promoted
Next Level Business Services, Inc.
Redmond, Washington

Site Reliability Engineer - SMTP Service Management (Full Time). Site Reliability Engineer - SMTP Service Management. Site Reliability Engineer - SMTP Service Management. BS degree in Computer Science, Computer Engineering, Electrical Engineering, Management Information Systems, or other technical f...

Oracle
Seattle, Washington

We’re looking for Site Reliability Engineers (SRE’s) to help build highly distributed systems, platform services and tools for a highly distributed multi-tenant cloud environment at massive scale. When not working on operations the SRE is working on software engineering tasks such as design and deve...

Federal Reserve System
Seattle, Washington

Site Reliability Engineer, you will be part of the Data & Analytics Services (DAS) Team and will get an opportunity to broadly apply your engineering skills across various technology solutions, as well as build your skills in other areas by being exposed to various aspects of product delivery from i...

Oracle
Seattle, Washington

As a Site Reliability Engineer, you will solve interesting technical challenges by defining, designing, deploying, and solving key Oracle Cloud services, platforms, and infrastructure, always thinking about reliability, scalability, resilience, security, and performance. We are unencumbered and will...

Mondrian Alpha
Seattle, Washington

An industry leading systematic trading fund is seeking highly skilled Site Reliability Engineers to join a team responsible for engineering and supporting the companies critical infrastructure platforms. This team also handles the centralized development infrastructure and works alongside engineerin...

ByteDance
Seattle, Washington

Scale systems sustainability through mechanisms such as automation and evolve systems reliability, efficiency, and velocity by pushing for changes. Participate in technical operations and rotations in response to performance and reliability issues. ...