Manager, Cloud Operations and Systems Engineering
Marathon Health is a leading provider of advanced primary care in the U.S., serving 2.5 million eligible patients through approximately 630 employer and union-sponsored clients.
The Cloud Operations and Systems Engineering Manager leads our cloud infrastructure and systems operations team within our Technology organization. This role is responsible for AWS cloud deployments and operations, and oversight of Microsoft 365 (M365) environments.
Essential Duties & Responsibilities
- Lead and mentor a high-performing team of cloud and systems engineers.
- Drive strategic planning and execution of cloud and systems initiatives.
- Foster a culture of continuous improvement and operational excellence.
Technical Execution
Oversee AWS cloud architecture, deployment, monitoring, and optimization.Oversee M365 services including Exchange Online, SharePoint, Teams, and Intune.Ensure system performance, availability, and scalability through proactive monitoring and remediation.Support production environments and ensure high availability and reliability of services.Operational Excellence
Lead and participate in infrastructure and platform efforts that span multiple teams, aligning technical solutions with organizational goals and engineering priorities.Own the IT operations support process, including ticket management and SLA achievement.Develop and refine operational processes, automation strategies, and incident response protocols.Lead implementation of observability and monitoring practices to ensure visibility into system health and performance.Collaboration & Cross-Functional Delivery
Partner closely with the VP of IT Operations, VP of Core Engineering, VP of Data and Analytics, and the Principal Architect to align infrastructure and systems strategies with business goals.Collaborate with Information Security and Compliance team to implement IAM policies, secrets management practices, and audit controls across environments.Ensure infrastructure and systems are designed and maintained in compliance with HIPAA and PCI standards.Support the deployment and operationalization of internally engineered solutions, including Marathon Health's patient portal and admin portal, and third-party solutions including Salesforce, Snowflake, Tableau.Team Development
Provide technical leadership, guidance, and mentoring to Site Reliability Engineers, System Engineers, and System Administrators.Conduct regular performance reviews, training, and career development planning.Promote knowledge sharing and best practices across the IT Operations team.Qualifications
Bachelor's degree in systems engineering, computer science, or a related field and a minimum of 5 years' experience deploying cloud-based applications or developing in cloud environments, or equivalent combination of education and experience. AWS certification preferred.
Preferred Technical Skills
Cloud & Infrastructure
Strong and current AWS expertise, particularly in ECS, Docker, Kubernetes, IAM, and Terraform.Experience with containerization, microservices, and CI / CD pipelines.Familiarity with virtualization technologies and observability tools.Enterprise Platforms
Experience with M365, Snowflake, Databricks, BOOMI, Tableau, and Salesforce.Experience with supporting multiple environments (Dev / QA / Staging / Prod) and aligning with engineering development teams and best practices.DESIRED ATTRIBUTES
Experience leading DevOps or Cloud Engineers, System Engineers, and System Administrators.Comfortable working in a fast-paced, production-support environment with flexible hours as needed.Strong communication and collaboration skills across distributed teams.Ability to work independently and provide knowledge transfer to internal teams.Effective time management, prioritization, and organizational skills.Exceptional attention to detail, strong work ethic, and excellent analytical and problem-solving abilities.Experience leading teams responsible for development and deployment of automated tools, systems, and services across multiple technology domains.Strong understanding of AWS account structure best practices, networking, and VPC configurations.Advanced knowledge of infrastructure components including networking, cloud services, orchestration tools, containerization, compute, and storage systems.Experience with version-controlled Infrastructure as Code (IaC) tools and practices.Understanding of Kubernetes and container orchestration technologies.Familiarity with industry compliance and security frameworks such as HIPAA, SOC 2, and HITRUST.Pay Range : $120,000 - $160,000 / yr
J-18808-Ljbffr