Search jobs > Elk Grove, CA > Senior site reliability

Senior Site Reliability Engineer, Insight BPR

Apple
Elk Grove
Full-time

Summary :

Do you want to be part of a group critical to the success of Apple? Are you a Senior Site Reliability Engineer who is passionate about solving hard problems, owning the entire solution and leveraging cutting edge technologies to enable business operations?

Do you enjoy creating automation to eliminate toil? Do you excel under pressure? Can you summarize highly complex problems so that others can help you solve them?

Do you have rock solid integrity and are the team member people trust and count on? Does everyone turn to you to brainstorm solutions?

Do you like gathering evidence to base your decisions off of, but can use your gut, intuition and experience to make quick decisions when necessary?

If you smile in the face of pressure, can work independently but are also great team player, we're looking for you! We are seeking an experienced and highly skilled Senior Site Reliability Engineer (SRE) to join our dynamic team.

As a Senior SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems and applications.

The ideal candidate will have a deep understanding of cloud technologies, strong problem-solving skills, and a proven track record of implementing and maintaining robust infrastructure.

Key Qualifications :

Have a passion for Site Reliability Engineering and a flexible, creative approach to problem solving.5+ years of hands-on experience with one or more programming languages : Java, Python, Node, Go or RubyFull-stack experience.

Frontends using Python or Javascript along with frameworks (Flask, ReactJS, Angular, etc) as well as backends using different stacks (PHP Symphony, NodeJS, Express, etc).

Demonstrated experience with relational databases : MySQL, Postgres, etc3+ years of hands-on experience with KubernetesExperienced professional with a deep experience with cloud providers such as AWS or GCPExperience with at least one of these monitoring systems : AppDynamics, Grafana, Kibana, Prometheus, InfluxDBExperience with build automation, source control and CI / CD tools (ArgoCD, GitHub, Artifactory, Jenkins, Spinnaker, etc)Linux configuration, deployment and troubleshootingExcellent problem solving and programming skills;

proven technical leadership and communication skillsFlexibility for travel and work schedules

Description :

Shape the next generation of big data solutions by working on the bleeding-edge technologies and solutions for the Insight BPR team.

Insight BPR is looking for exceptional engineers to help run, optimize and scale our environment to the next level. Be a member of the team that is responsible for the data collection and reporting for all of Apple’s products around the world.

You will operate and scale systems that every iPhone, iPad and Mac have interacted with. Apple’s engineering and operations teams will utilize your systems to build the next insanely great product.

In this role, you will be working with very large-scale, highly-available Big Data platform supporting multi-Petabytes of data with super-linear growth.

You must have a build-to-manage , problem-solving and innovative mindset applied to the design, build, test, deploy, change and maintenance of enterprise class applications drawing from deep engineering expertise.

Key measures of success will include platform stability, effective integration and delivery, instrumentation, release quality, technical debt(toil) reduction, development of automation, risk / security compliance, and sustained advancement of the SRE practice.

As a member of a cross-functional team, you'll have the opportunity to solve challenging big data engineering problems across a broad range of Apple manufacturing services.

You will have hands on experience operating and managing very large scale systems.

Additional Requirements :

Cloud infrastructure as code experience, e.g., Crossplane, Pulumi, Terraform, CloudFormation, etc.Experience with Open API and Microservice architectureExperience with configuration management tools such as : Ansible, Chef, Puppet, SaltExperience in helping to define service agreements such as : Error budgets, SLOs, SLIs and SLAsExcellent problem solving and programming skills;

proven technical leadership and communication skillsAbility to learn new technologies quicklyExperience with Kafka, Elastic, Druid, Object Storage a strong plus

30+ days ago
Related jobs
Promoted
Apple
Elk Grove, California

Do you want to be part of a group critical to the success of Apple? Are you a Senior Site Reliability Engineer who is passionate about solving hard problems, owning the entire solution and leveraging cutting edge technologies to enable business operations? Do you enjoy creating automation to elimina...

Apple
Elk Grove, California

Do you want to be part of a group critical to the success of Apple? Are you a Senior Site Reliability Engineer who is passionate about solving hard problems, owning the entire solution and leveraging cutting edge technologies to enable business operations? Do you enjoy creating automation to elimina...

Promoted
Apple
Elk Grove, California

Are you a Senior Data Engineer with a passion for very large data sets analytics and can provide solutions to real business problems? Or perhaps you thrive on performing data discovery and creating proof of concepts? Do you want to be part of an energetic team critical to the success of Apple? The I...

Promoted
Astranis Space Technologies
CA, United States

The component reliability engineer will work closely with our electrical, thermal, and mechanical teams to select EEE components, assess their reliability, design solutions to our reliability challenges, and to plan and execute additional qualification or screening processes as needed. Senior EEE Co...

Promoted
Insight Global
CA, United States

A client needs a Senior Network Engineer in the Ventura, California area. ...

Tencent
California, US

Are you passionate about gaming and skilled in managing distributed online systems? Uncapped Games is looking for a Site Reliability Engineer like you! Join us in our quest to revolutionize the Real-Time Strategy (RTS) genre with our groundbreaking new game. ...

E-Solutions
California, United States

Site Reliability Engineer (SRE). We are seeking a skilled Site Reliability Engineer (SRE) to join our dynamic team. You will be responsible for ensuring the availability and reliability of our SaaS products, which host customer data and require 24x7 uptime. Ensure the reliability, availability, and ...

eTeam
Remote, CA
Remote

Minimum years exp in Terraform, Ansible, Networking, Jenkins, Python, GCP in Technology companies.Security (vulnerability management)....

Canonical - Jobs
Sacramento, California

Our site reliability engineers bring Python software-engineering skills and rigour to the operations domain. To succeed in this role you need to believe in automation as a pure software engineering problem, not a hack-it-till-it-works-for-me problem. To become a member of this team, you need to be a...

Splunk Inc
California, United States
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Splunk's Cloud Services group is looking for a Site ReliabilityEngineer to help lead, design and b...