A company is looking for a Manager of Site Reliability Engineering.
Key Responsibilities
Lead and support a team of SREs and DBREs through performance management and career development
Define and evolve the SRE roadmap, including reliability metrics, tooling, and incident response
Oversee the operation and scaling of GCP infrastructure and drive automation across service operations
Required Qualifications
6+ years in infrastructure, site reliability, or cloud engineering roles, with 2-3+ years leading SRE teams
Deep experience operating systems in GCP (compute, IAM, storage, networking, etc.)
Hands-on skills with infrastructure-as-code (Terraform), CI / CD pipelines, and Kubernetes
A mindset of mentorship and empathy for developers and members
Experience with MySQL, Redis, or API gateways is a plus
Manager Site Reliability • San Angelo, Texas, United States