GEICO is seeking an experienced and visionary SRE Senior Manager to join the organization and aid the establishment and growth of the Site Reliability Engineering (SRE) practice for Hybrid Cloud - Infrastructure as a Service (IaaS).
As an SRE Leader, you will be responsible for leading and driving data center and network engineering initiatives to enhance the reliability, availability, performance, and security of Geico’s private and public cloud infrastructure.
You will collaborate closely with cross-functional teams, including Data Center and Network Engineering, Security, and PaaS / Application Software Development, to ensure robust data center and network architecture and software solutions and seamless operations.
This role combines technical expertise, leadership, and strategic thinking to drive continuous improvement in reliability and scalability.
Key Responsibilities :
Technical Leadership :
Provide strategic direction and technical leadership in the design, development, and deployment of a robust, reliable, and scalable digital infrastructure.
Drive the architecture, design, and optimization of highly available, scalable, and fault-tolerant systems and services supporting the digital engineering team.
Team Management :
Build and nurture a high-performing SRE team, providing mentorship, coaching, and guidance to foster a culture of continuous learning and improvement.
Work closely with all GEICO Tech products and platforms to manage, innovate and create new programs, software and analytics that improve the availability, scalability, latency and effectiveness of GEICO products and services.
Collaborate with cross-functional leaders including product area leads to guide product engineering to build reliable and durable production systems and contribute to the strategic direction of the Tech organization.
Collaboration and Communication :
Present a reliability vision and strategic recommendations with clarity and concision to stakeholders having varying degrees of SRE fluency.
Develop and own relationships with technology and business partners.
Foster effective collaboration and communication across cross-functional teams to align priorities, share best practices, and ensure smooth coordination for incident response, system maintenance, and upgrades.
Manage department budgets, resource allocation, and vendor relationships to optimize costs and maintain high-quality outcomes.
Qualifications :
Bachelor's degree in Computer Science, Information Technology, or a related field (Master's degree preferred).
Proven experience in a leadership role focused on software defined and software driven data center and network engineering within a complex, large-scale production environments.
Deep knowledge of SRE practices, methodologies, and principles, along with a solid understanding of on prem and public cloud based network, compute and storage technologies.
In-depth knowledge of hybrid cloud architecture, IaaS technologies, container orchestration platforms (e.g., Kubernetes), cloud efficiency and observability etc.
Strong background in incident management, performance tuning, and capacity planning. including creating incident response playbooks, incident triaging strategies, and post-incident analysis to drive continuous improvement in system reliability and availability.
Experience with open source management and monitoring tools (e.g. Cacti, Zabbix, Splunk, Prometheus, Grafana)
Experience with infrastructure automation, tooling, and configuration management frameworks (e.g., Puppet, Chef, Ansible, Terraform, etc.).
Familiarity with cloud security best practices and compliance standards.
Excellent leadership and team management skills with a passion for mentoring and fostering professional growth.
Strong problem-solving and analytical abilities, with a keen eye for detail and a passion for driving operational efficiency.
Experience in budget management, resource allocation, and vendor collaboration.
Certifications such as AWS Certified DevOps Engineer, Google Professional DevOps Engineer, or relevant cloud provider certifications are a plus.
LI-RP2
DICE
Annual Salary
$115,000.00 - $261,500.00
The above annual salary range is a general guideline. Multiple factors are taken into consideration to arrive at the final hourly rate / annual salary to be offered to the selected candidate.
Factors include, but are not limited to, the scope and responsibilities of the role, the selected candidate’s work experience, education and training, the work location as well as market and business considerations.
At this time, GEICO will not sponsor a new applicant for employment authorization for this position.
Benefits :
As an Associate, you’ll enjoy our
- to help secure your financial future and preserve your health and well-being, including :
- Premier Medical, Dental and Vision Insurance with no waiting period
- Paid Vacation, Sick and Parental Leave
- 401(k) Plan
- Tuition Reimbursement
- Paid Training and Licensures
- Benefits may be different by location. Benefit eligibility requirements vary and may include length of service.
Coverage begins on the date of hire. Must enroll in New Hire Benefits within 30 days of the date of hire for coverage to take effect.