Talent.com
Lead Site Reliability Engineer - Cloud Operations. - US Citizen Required
Lead Site Reliability Engineer - Cloud Operations. - US Citizen RequiredOracle • Salt Lake City, UT, US
serp_jobs.error_messages.no_longer_accepting
Lead Site Reliability Engineer - Cloud Operations. - US Citizen Required

Lead Site Reliability Engineer - Cloud Operations. - US Citizen Required

Oracle • Salt Lake City, UT, US
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Description

Oracle Health (OHAI) is a leader in generative AI for healthcare, focusing on cutting-edge cloud services that streamline healthcare operations. Our EHR and Clinical AI Agent platforms help healthcare providers reduce manual tasks and improve patient care. We are expanding our OCI Cloud Operations team and are seeking a Principal Site Reliability Engineer (SRE) to ensure the reliability, performance, and scalability of these services in a production environment.

Role Overview

As a key member of Oracle Health's Cloud Operations team, you will be responsible for the operational health, reliability, and performance of our EHR and Clinical AI Agent services. You will ensure these customer-facing platforms meet the highest standards of scalability, availability, and security, while developing and executing strategies for continuous optimization. Your focus will include cloud service performance monitoring, incident management, and operational efficiency across Kubernetes-based environments. In this role, you will serve as a technical team lead, providing mentorship to Site Reliability Engineers (SREs)

Key Responsibilities

Service Ownership & Leadership :

Own operational aspects of the EHR and Clinical AI Agent cloud services, ensuring their reliability and performance.

Serve as a technical lead within the Site Reliability Engineering (SRE) team, promoting adherence to best practices in incident management, service design, operational excellence, and automation.

Mentor and support team members, fostering professional growth, technical proficiency, and a culture of accountability and continuous improvement in service operations

Operations Engineering

Oversee deployment and operations of cloud services across commercial and government data centers, ensuring compliance with corporate and regulatory standards.

Monitor and optimize resource utilization, performance, and scalability to maintain high availability and reliability.

Ensure security and compliance of all services in alignment with organizational and governmental requirements.

Manage incidents proactively, identifying and resolving issues in real time to minimize downtime and maintain system stability.

Lead critical incident response efforts, collaborating with development teams to implement corrective and preventive measures.

Service Design & Optimization :

Design and implement zero-downtime deployment strategies for software and security updates.

Work with cross-functional teams to enhance service stability and ensure predictable, efficient operation.

Implement proactive measures for system failure analysis and rapid issue resolution.

Automation & Continuous Improvement :

Lead automation initiatives to streamline operational tasks and reduce manual intervention.

Drive improvements to monitoring and alerting frameworks using tools like Prometheus and Grafana.

Implement Infrastructure as Code (IaaC) practices using Terraform and Shepherd.

Assist in the optimization of cloud-based services, ensuring smooth performance, scalability, and operational efficiency.

Qualifications

Experience : 8+ years in Site Reliability Engineering, DevOps, or Cloud Operations.

Experienced in managing customer-facing, Kubernetes-based Cloud services.

Cloud & Container Technologies : Experience with OCI, Kubernetes, Docker, Prometheus, Grafana, and cloud-native solutions.

Scripting & Automation : Proficiency in Python, Perl, Shell Scripting, and tools like Terraform.

Incident Management : Strong troubleshooting skills for resolving complex issues in production systems.

Cloud Platforms : Experience with OCI, AWS, GCP, or Azure.

Version Control : Familiarity with Git.

Operating Systems : Extensive experience with Linux / Unix environments in a Cloud Production environment.

Security & Compliance : Knowledge of cloud security best practices, particularly in regulated industries like healthcare.

US Citizenship on US soil is required. This position requires you to be eligible to receive a federal security clearance which requires you to be a US Citizen.

Responsibilities

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and / or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

Disclaimer

Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.

Range and benefit information provided in this posting are specific to the stated locations only

US : Hiring Range in USD from : $86,400 to $199,500 per annum. May be eligible for bonus and equity.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.

Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following

Medical, dental, and vision insurance, including expert medical opinion

Short term disability and long term disability

Life insurance and AD&D

Supplemental life insurance (Employee / Spouse / Child)

Health care and dependent care Flexible Spending Accounts

Pre-tax commuter and parking benefits

401(k) Savings and Investment Plan with company match

Paid time off : Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.

11 paid holidays

Paid sick leave : 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.

Paid parental leave

Adoption assistance

Employee Stock Purchase Plan

Financial planning and group legal

Voluntary benefits including auto, homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.

Career Level - IC4

About Us

As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

serp_jobs.job_alerts.create_a_job

Site Reliability Engineer • Salt Lake City, UT, US

Job_description.internal_linking.related_jobs
Site Reliability Engineer Lead

Site Reliability Engineer Lead

VirtualVocations • Salt Lake City, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Site Reliability Engineer, Team Lead.Key Responsibilities Ensure 24x7 availability of production application systems and drive operational efficiency initiatives Ident...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Cloud Engineer

Cloud Engineer

Unisys Corporation • Salt Lake City, UT, United States
serp_jobs.job_card.full_time
What success looks like in this role : .Design and implement cloud computing solutions using AWS, Azure, or Google Cloud Platform. Manage and optimize cloud infrastructure, ensuring high availability ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Database Reliability Engineer

Senior Database Reliability Engineer

VirtualVocations • Salt Lake City, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Senior Database Reliability Engineer to ensure the performance, scalability, and reliability of its databases and supporting applications. Key Responsibilities Optimize ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Principal Site Reliability Engineer

Principal Site Reliability Engineer

VirtualVocations • Salt Lake City, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Consulting / Principal Site Reliability Engineer.Key Responsibilities Lead Kubernetes deployment and management, including orchestration, architecture, networking, CI / CD,...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Cloud Engineer

Cloud Engineer

VirtualVocations • Provo, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Cloud Engineer Specialist with expertise in cloud computing and Kubernetes.Key Responsibilities Conduct discussions on architectural solutions to ensure scalability and...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior DevSecOps Engineer

Senior DevSecOps Engineer

VirtualVocations • Provo, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Senior DevSecOps Engineer to support the Defense Logistics Agency's secure API Gateway Program. Key Responsibilities Design, develop, and manage Ansible playbooks, roles...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer Team Lead

Site Reliability Engineer Team Lead

VirtualVocations • Salt Lake City, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Site Reliability Engineer, Team Lead.Key Responsibilities Ensure 24x7 availability of production application systems Drive initiatives to improve operational efficienc...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Cyber Reliability Engineer

Cyber Reliability Engineer

VirtualVocations • Salt Lake City, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Cyber Reliability Engineer Senior Consultant specializing in Infrastructure Monitoring.Key Responsibilities Collaborate with cross-functional teams to ensure monitoring...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineering Manager

Site Reliability Engineering Manager

VirtualVocations • Salt Lake City, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Site Reliability Engineering Manager (Remote).Key Responsibilities Collaborating in a team-oriented environment and managing performance of direct reports Organizing t...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Lead Site Reliability Engineer

Lead Site Reliability Engineer

VirtualVocations • Provo, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Lead Site Reliability Engineer (SRE).Key Responsibilities Drive incident response best practices, lead postmortems, and define SLAs / SLOs across platform services Colla...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

BankTalent HQ • Midvale, UT, United States
serp_jobs.job_card.full_time
Zions Bancorporation's Enterprise Technology and Operations (ETO) team is transforming what it means to work for a financial institution. With a commitment to technology and innovation, we have been...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Sr AWS Cloud DevOps Engineer

Sr AWS Cloud DevOps Engineer

Unisys Corporation • Salt Lake City, UT, United States
serp_jobs.job_card.full_time
What success looks like in this role : .DevSecOps Pipeline Design & Automation : .Design and implement secure, automated CI / CD pipelines in AWS using tools like AWS CodePipeline, Jenkins, GitLab CI, an...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
AWS Cloud Engineer

AWS Cloud Engineer

VirtualVocations • Provo, Utah, United States
serp_jobs.job_card.full_time
A company is looking for an AWS Cloud Engineer.Key Responsibilities Design and implement scalable, secure, and cost-effective cloud infrastructure solutions on AWS Develop and maintain Terraform...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior AWS DevOps Engineer

Senior AWS DevOps Engineer

VirtualVocations • Provo, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Senior AWS DevOps Engineer.Key Responsibilities Collaborate with development teams to design and optimize cloud solutions on AWS Automate infrastructure provisioning a...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

VirtualVocations • Salt Lake City, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of their systems and applications. Key Responsibilities Build, maintain, and oper...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Unisys Corporation • Salt Lake City, UT, United States
serp_jobs.job_card.full_time
What success looks like in this role : .Design, implement, and manage scalable and reliable systems.Monitor system performance and troubleshoot issues. Collaborate with development teams to improve th...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Customer Reliability Engineer

Customer Reliability Engineer

VirtualVocations • Provo, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Customer Reliability Engineer III.Key Responsibilities Manage and resolve customer technical issues via support tickets and real-time interactions Act as a liaison bet...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Cloud Engineer

Senior Cloud Engineer

VirtualVocations • Provo, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Senior Cloud Engineer - DevOps.Key Responsibilities Architect and implement automated CI / CD pipelines for various application workloads Design, build, and manage scala...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior DevOps Engineer

Senior DevOps Engineer

VirtualVocations • Salt Lake City, Utah, United States
serp_jobs.job_card.full_time
A company is looking for a Senior DevOps Engineer.Key Responsibilities : Implement and support the production platform Innovate in the engineering platform space by incorporating new technologies...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
AWS DevOps Engineer

AWS DevOps Engineer

VirtualVocations • Provo, Utah, United States
serp_jobs.job_card.full_time
A company is looking for an AWS DevOps Engineer - Tech Lead.Key Responsibilities Collaborate with development teams to design and optimize cloud solutions on AWS Automate infrastructure provisio...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted