Search jobs > Los Angeles, CA > Site reliability engineer

Principal, Site Reliability Engineer - Kubernetes (R50024261)

Fox Corporation
Los Angeles, California
$161.5K-$212K a year
Full-time

OVERVIEW OF THE COMPANY

Fox CorporationUnder the FOX banner, we produce and distribute content through some of the world’s leading and most valued brands, including : FOX News Media, FOX Sports, FOX Entertainment, FOX Television Stations and Tubi Media Group.

We empower a diverse range of creators to imagine and develop culturally significant content, while building an organization that thrives on creative ideas, operational expertise and strategic thinking.

JOB DESCRIPTION

At Fox Tech, we stand as a beacon of innovation, crafting world-class, large scale digital products that redefine the entertainment experience.

We're on the lookout for visionary individuals to join our pioneering team, tasked with shaping the future of streaming products.

Now is your chance to be part of creating and delivering extraordinary digital experiences spanning Sports and Entertainment.

As a key member of our team, you'll drive innovation and significantly contribute to our mission of pioneering the next generation of streaming products.

Your opportunity to create unparalleled fan experiences for these iconic sports events is here. Our current advanced digital solutions, accessed by millions across web, mobile, and living room devices, signify just the start of our ambitious journey.

ABOUT THE ROLE

Fox is hiring a Principal Site Reliability Engineer - Kubernetes to build and operate infrastructure and platforms to support APIs around our live direct to consumer APIs for major live events such as the Super Bowl, World Cup, and World Series.

The principal engineer will be the technical lead for solving thundering herd problems including partnering with the application team to load test, scale up and scale back down again and help design the platform and infrastructure to meet their needs.

A collaborative, peacemaker mindset is a must while fostering a culture of learning and continuous improvement for the entire team.

The principal engineer will additionally work with the Director, Platform Engineering to visualize workflows, and refine processes and policies to keep the team throughput high.

A SNAPSHOT OF YOUR RESPONSIBLITIES

Serve as technical lead for the implementation and operation of cloud-based infrastructure and platform including EKS and other AWS services supporting direct to consumer APIs and solving associated thundering herd problems including load testing, scaling up and scaling back down again

Work closely with Video & Player Engineering and 3rd party teams to help design and implement scalability, cost visibility and observability in the platform

Help to mentor and train less senior members of the team

Assist with product / technology selection including evaluating maturity, support and design and implementation of POCs

Work with the Director, Site Reliability Engineering to foster a culture of learning and continuous improvement, help to conceptualize and visualize workflows and processes

Perform post-incident analysis to identify root causes and potential workarounds / solutions

Be fluid and open to change and evolving processes and tools

Other duties as assigned

WHAT YOU WILL NEED

Expert with EKS, Kubernetes and AWS including IAM, autoscaling, networking and load balancing / request routing

Proven experience with solving scalability problems both up and down including thundering herd scenarios

Expert with troubleshooting and root cause analysis

Expert experience with Python

Strong analytical skills

Strong communication skills, both verbal and written

Proven experience with building deployment pipelines and enabling self-service

Strong teamwork and willingness to collaborate with others

Proven experience with training and mentoring engineers

NICE TO HAVE, BUT NOT A DEALBREAKER

BS or equivalent

AWS Solutions Architect Professional certification

SCHEDULE / SHIFT

Coverage during NFL Sundays and other marquee live events throughout the year

NOTE : Fox Behavioral Skills / Competencies : Dependability, Initiative, Teamwork, Personal Integrity, Professionalism; Fox Leadership Skill / Competencies : Sets Clear Goals, Seeks Collaborative Solutions, Delivers Constructive Feedback, Leadership Integrity and Compliance

Ll-DM1

Ll-Hybrid

Learn more about Fox Tech at

foxtechPursuant to state and local pay disclosure requirements, the pay range for this role, with final offer amount dependent on education, skills, experience, and location is : $161,500.

00-212,000.00 annually for California. This role is also eligible for an annual discretionary bonus, various benefits, including medical / dental / vision, insurance, a 401(k) plan, paid time off, and other benefits in accordance with applicable plan documents.

Benefits for Union represented employees will be in accordance with the applicable collective bargaining agreement.

30+ days ago
Related jobs
Promoted
City National Bank
Los Angeles, California

SITE RELIABILITY PRINCIPAL ENGINEER. As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems. Your role is to ensure the reliability, scalability and maximum uptime of CNB systems in the Data Center or Cloud Platf...

Promoted
VirtualVocations
Burbank, California

A company is looking for a Site Reliability Engineering Architect to lead a team responsible for system reliability, performance, and efficiency. ...

Fox Corporation
Los Angeles, California

Fox is hiring a Principal Site Reliability Engineer - Kubernetes to build and operate infrastructure and platforms to support APIs around our live direct to consumer APIs for major live events such as the Super Bowl, World Cup, and World Series. The principal engineer will additionally work with the...

Promoted
VirtualVocations
Burbank, California

A company is looking for a Lead Site Reliability Engineer. ...

Promoted
Dunhill Professional Search & Government Solutions
Los Angeles, California
Remote

The Site Reliability Engineer will be joining a team responsible for developing and maintaining tools, alerts, and dashboards to support the Technical Operations team in monitoring application health and performance. The engineer will be responsible for implementing improvements to processes to impr...

Promoted
VirtualVocations
Burbank, California

A company is looking for a Senior Associate Site Reliability Engineer responsible for designing, building, and maintaining infrastructure for highly available solutions. ...

Promoted
TikTok
Los Angeles, California

Proven work experience as a Site Reliability Engineer, Systems Engineer, or similar software engineering role. Site Reliability Engineering(SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. The teams within USDS ...

Promoted
SpaceX
Hawthorne, California

GNC Site Reliability Engineer to operate and scale custom-built mission-critical products for Guidance Navigational and Control (GNC). Bachelor's degree in computer science, information systems/IT, engineering, math, or scientific discipline and 5 years of software development experience OR 7+ years...

Gusto
Los Angeles, California

Staff Site Reliability Engineer. Gusto’s Infrastructure Engineering team enables our product teams to build impactful products by building secure, resilient, and accessible systems, using tools like AWS, terraform, and Kubernetes. Establish standards and build deterministic automation while optimizi...

Splunk Inc
California, United States
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Kubernetes certifications or an interest in obtaining these certifications are a plus, such as tho...