Search jobs > Elk Grove, CA > Senior site reliability

Senior Site Reliability Engineer, Insight BPR

Apple
Elk Grove
Full-time

Summary :

Do you want to be part of a group critical to the success of Apple? Are you a Senior Site Reliability Engineer who is passionate about solving hard problems, owning the entire solution and leveraging cutting edge technologies to enable business operations?

Do you enjoy creating automation to eliminate toil? Do you excel under pressure? Can you summarize highly complex problems so that others can help you solve them?

Do you have rock solid integrity and are the team member people trust and count on? Does everyone turn to you to brainstorm solutions?

Do you like gathering evidence to base your decisions off of, but can use your gut, intuition and experience to make quick decisions when necessary?

If you smile in the face of pressure, can work independently but are also great team player, we're looking for you! We are seeking an experienced and highly skilled Senior Site Reliability Engineer (SRE) to join our dynamic team.

As a Senior SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems and applications.

The ideal candidate will have a deep understanding of cloud technologies, strong problem-solving skills, and a proven track record of implementing and maintaining robust infrastructure.

Key Qualifications :

Have a passion for Site Reliability Engineering and a flexible, creative approach to problem solving.5+ years of hands-on experience with one or more programming languages : Java, Python, Node, Go or RubyFull-stack experience.

Frontends using Python or Javascript along with frameworks (Flask, ReactJS, Angular, etc) as well as backends using different stacks (PHP Symphony, NodeJS, Express, etc).

Demonstrated experience with relational databases : MySQL, Postgres, etc3+ years of hands-on experience with KubernetesExperienced professional with a deep experience with cloud providers such as AWS or GCPExperience with at least one of these monitoring systems : AppDynamics, Grafana, Kibana, Prometheus, InfluxDBExperience with build automation, source control and CI / CD tools (ArgoCD, GitHub, Artifactory, Jenkins, Spinnaker, etc)Linux configuration, deployment and troubleshootingExcellent problem solving and programming skills;

proven technical leadership and communication skillsFlexibility for travel and work schedules

Description :

Shape the next generation of big data solutions by working on the bleeding-edge technologies and solutions for the Insight BPR team.

Insight BPR is looking for exceptional engineers to help run, optimize and scale our environment to the next level. Be a member of the team that is responsible for the data collection and reporting for all of Apple’s products around the world.

You will operate and scale systems that every iPhone, iPad and Mac have interacted with. Apple’s engineering and operations teams will utilize your systems to build the next insanely great product.

In this role, you will be working with very large-scale, highly-available Big Data platform supporting multi-Petabytes of data with super-linear growth.

You must have a build-to-manage , problem-solving and innovative mindset applied to the design, build, test, deploy, change and maintenance of enterprise class applications drawing from deep engineering expertise.

Key measures of success will include platform stability, effective integration and delivery, instrumentation, release quality, technical debt(toil) reduction, development of automation, risk / security compliance, and sustained advancement of the SRE practice.

As a member of a cross-functional team, you'll have the opportunity to solve challenging big data engineering problems across a broad range of Apple manufacturing services.

You will have hands on experience operating and managing very large scale systems.

Additional Requirements :

Cloud infrastructure as code experience, e.g., Crossplane, Pulumi, Terraform, CloudFormation, etc.Experience with Open API and Microservice architectureExperience with configuration management tools such as : Ansible, Chef, Puppet, SaltExperience in helping to define service agreements such as : Error budgets, SLOs, SLIs and SLAsExcellent problem solving and programming skills;

proven technical leadership and communication skillsAbility to learn new technologies quicklyExperience with Kafka, Elastic, Druid, Object Storage a strong plus

30+ days ago
Related jobs
Promoted
Apple
Elk Grove, California

Do you want to be part of a group critical to the success of Apple? Are you a Senior Site Reliability Engineer who is passionate about solving hard problems, owning the entire solution and leveraging cutting edge technologies to enable business operations? Do you enjoy creating automation to elimina...

Apple
Elk Grove, California

Do you want to be part of a group critical to the success of Apple? Are you a Senior Site Reliability Engineer who is passionate about solving hard problems, owning the entire solution and leveraging cutting edge technologies to enable business operations? Do you enjoy creating automation to elimina...

CoStar Group
CA, Orange County

On-site fitness center and/or reimbursed fitness center membership costs (location dependent), with yoga studio, Pelotons, personal training, group exercise classes, as well as Segways and bikes available for use during the day. ...

E-Solutions
California, United States

Site Reliability Engineer (SRE). We are seeking a skilled Site Reliability Engineer (SRE) to join our dynamic team. You will be responsible for ensuring the availability and reliability of our SaaS products, which host customer data and require 24x7 uptime. Ensure the reliability, availability, and ...

Robert Half
CA, United States

Currently, I have a client that is in the entertainment industry is looking for a Site Reliability Engineer to join their engineering team. The Site Reliability Engineer position is fully remote. The Site Reliability Engineer should have experience in AWS, Terraform, Kubernetes, and Linux. The tasks...

Splunk Inc
California, United States

You will partner with senior engineers to solve difficult problems. Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splu...

Divergent
CA, United States

We are seeking a driven and adept Engineer focused on the development and enhancement of processes related to Additive Manufacturing (AM) equipment, with a specific emphasis on laser powder bed fusion (LPBF). Develop and track maintenance and reliability metrics. Bachelor’s degree in Mechanical Engi...

PEAK Technical Staffing
Local Remote, CA
Remote

This SRE role will focus on providing direct, level one and two support to internal engineering teams. Engage directly with engineering customers on troubleshooting requests and guiding them on solutions. Hands on experience in working with distributed systems and availability, reliability, scalabil...

Splunk Inc
California, United States
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Splunk's Cloud Services group is looking for a Site ReliabilityEngineer to help lead, design and b...

Fractal
California

As a Site Reliability Engineer with Fractal, you will be dedicated to ensuring the highest system availability and performance levels. You'll need to be onsite or have the ability to move. You will work closely with our Services and Engineering teams, playing a crucial role in optimizing our platfor...