Engineering Manager, Infrastructure
As an Engineering Manager for the Infrastructure team, you'll lead the engineers responsible for keeping Apollo's systems fast, reliable, and scalable as we serve millions of daily users and process billions of data points. You'll work at the intersection of platform engineering, SRE, observability, and developer productivity, ensuring that our foundation can support Apollo's AI-native evolution and rapid growth.
You will :
- Build and lead a world-class infrastructure team focused on reliability, scalability, and performance.
- Report directly to the CTO, and partner closely with Product, Data, and AI Platform leaders to ensure the underlying systems enable fast, safe, and confident iteration.
- Drive best-in-class engineering practices for production uptime, performance, CI / CD, observability, incident management, and cost optimization.
- Foster a culture of excellence, ownership, and continuous improvement where engineers are empowered to innovate and ship fearlessly.
- Help define Apollo's next-generation infrastructure strategy from cloud architecture to developer experience and AI-driven automation.
Daily adventures and responsibilities include :
Leading, coaching, and growing a distributed team of high-impact Infrastructure Engineers. You are expected to spend roughly 30% of the time as an IC.Partnering with senior engineering leadership on strategic initiatives such as cloud migration, infrastructure scaling, platform reliability, and cost efficiency.Defining and implementing modern operational excellence practices, including SLOs, error budgets, incident reviews, and performance monitoring.Guiding technical decision-making across key areas like Kubernetes, GCP, observability, networking, CI / CD, and IaC (Terraform, Ansible).Collaborating with AI, Data, and Product Engineering teams to ensure infrastructure scalability for ML and AI-native workloads.Running effective 1 : 1s, career development conversations, and quarterly performance reviews.Supporting recruiting efforts to attract top engineering talent across time zones.Competencies include :
Thinking in systems and scaling reliability through automation, not headcount.Being equally comfortable talking about incident response policies as Terraform modules or service mesh architecture.Building psychological safety within teams and helping engineers grow through mentorship and clear accountability.Communicating crisply, documenting thoughtfully, and driving alignment across distributed, cross-functional teams.Embracing Apollo's values of Ownership, Integrity, Curiosity, Excellence, Teamwork, and Fun.Skills and relevant experience include :
5+ years of hands-on software or infrastructure engineering experience.2+ years of experience leading teams of senior and staff-level engineers in platform, SRE, or infrastructure domains.Proven ability to design and operate large-scale distributed systems in cloud environments (preferably GCP or AWS).Expertise with Kubernetes, Docker, Terraform, Ubuntu, and CI / CD pipelines.Familiarity with observability tools (Grafana, Prometheus, ELK, Datadog, NewRelic) and performance tuning.Strong grounding in networking, security, and reliability principles.Experience managing infrastructure costs, availability SLAs, and high-throughput systems at scale.Bonus : Experience with AI / ML infrastructure, data pipelines, MongoDB, Ruby on Rails, Ansible, or ElasticSearch.Why you'll love working at Apollo :
Autonomy & Impact : Lead a mission-critical domain powering the growth of a top SaaS company.A Learning Culture : Work alongside brilliant engineers, experiment with emerging AI-driven DevOps tools, and stay at the cutting edge of cloud-native practices.Remote-First Flexibility : Collaborate across time zones while maintaining balance and focus.A High-Performance Team : Ownership, urgency, and support we win and grow together.If you're ready to lead Infrastructure for one of the fastest-scaling SaaS companies on the planet, join us.