Search jobs > Cupertino, CA > Site reliability engineer

ASE -Site Reliability Engineer

Apple, Inc.
Cupertino, California, US
$143.1K-$264.2K a year
Full-time

Summary

Find out more about this role by reading the information below, then apply to be considered.

Posted : Sep 19, 2024

Role Number : 200549871

The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes, tools, and automation for managing distributed systems in production environments.

Our SRE team combines software and systems engineering and system administration practices to build and run large-scale, massively distributed, fault-tolerant systems.

Our software ensures that Apple's services are reliable, scalable and secure, and we leverage both open source and home-grown technologies to provide managed data infrastructure services.

You will help building next generation search infrastructure and platform services, collaborating cross-functionally with various ASE teams, from store and commerce to search and recommendations.

You'll create platforms that can rapidly scale to serve personalized and non-personalized data with very low latencies. You should be someone who is not afraid to question assumptions, are a good standout colleague under tight deadlines, and can take on problems with elegant technical solutions.

Description

The ASE SRE team develops applications and tooling that are safe, reliable, scalable, and fast. This work requires an innovative spirit and an extraordinary degree of care and difficulty in engineering.

Team members contribute to all major components of Redis deployment infrastructure, including maintenance automation, backup service application, monitoring and alerting tooling / dashboards, deployment architecture, focused on stability, performance, and scaling.

Success in this role requires expertise in several of the following : - Understanding of core SRE concepts - Monitoring, Alerting, Incident management.

  • Understanding of database concepts (consistency models, isolation levels, crash and recovery semantics). - Performance engineering (design concepts, profile-guided optimization).
  • Service management across a bare metal, virtualized (EC2),Kubernetes platforms. - Fundamentals of system-level hardware and networking components (storage devices and controllers, network interfaces, CPU and memory layout in server-class systems).
  • Operating systems concepts (process scheduling, disk and network I / O, performance). - Datacenter architecture (networking topologies, host placement strategies, and failure modes);

design of multi-datacenter systems; failure domains; and wide-area networking. This role also requires excellent communication and a high degree of customer focus when engaging with internal platform customers.

As a distributed team, ability to work optimally with colleagues based in other locations is also essential; experience in this area is a plus.

Prior experience with development or maintenance of distributed databases / storage systems is recommended. Apple values craftsmanship and Performance is a key ingredient.

Come join us at Apple Services Engineering and help us deliver services and applications that are fluid and responsive. You will collaborate with engineers from across Apple to define the metrics, set targets, uncover optimization opportunities, define quality guardrails, and ship a product / service that will delight our customers.

This role is for engineers who enjoy deep technical engineering that spans large cross-organizational projects. Your openness to learning and implementing new technologies will contribute to the continuous evolution of our organization.

  • Success in this role requires expertise in several of the following :
  • Understanding of core SRE concepts - Monitoring, Alerting, Incident management.
  • Understanding of database concepts (consistency models, isolation levels, crash and recovery semantics).
  • Performance engineering (design concepts, profile-guided optimization).
  • Service management across a bare metal, virtualized (EC2),Kubernetes platforms.
  • Fundamentals of system-level hardware and networking components (storage devices and controllers, network interfaces, CPU and memory layout in server-class systems).
  • Operating systems concepts (process scheduling, disk and network I / O, performance).
  • Datacenter architecture (networking topologies, host placement strategies, and failure modes); and wide-area networking.

Key Qualifications

  • Demonstrated expertise developing database systems, storage engines, distributed systems, or performance engineering.
  • Experience developing critical internet services and / or platform infrastructure.
  • Proficient in one or more of the following programming languages : Java, Go (golang), Python
  • Optional experiencing with managing services run on Kubernetes
  • Optional experience with EC2, EBS, and Terraform

Education & Experience

BS or MS in Computer Science / related fields or equivalent work experience

Additional Requirements

Pay & Benefits

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role.

The base pay range for this role is between $143,100 and $264,200, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs.

Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan.

You'll also receive benefits including : Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition.

Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

Note : Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

More

Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.

Learn more about your EEO rights as an applicant.

J-18808-Ljbffr

3 days ago
Related jobs
Promoted
Palo Alto Networks
Santa Clara, California

We are seeking development heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio in. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary ...

Promoted
Fortinet
Sunnyvale, California

Develop best practices alongside engineering/operations teams to improve the scalability and reliability of internal processes. Our team is growing, and we are looking for engineers with passion for automation. The US base salary range for this full-time position is $150,000-$230,000. Wage ranges ar...

Promoted
Grindr
Palo Alto, California

We are hiring a Site Reliability Engineer to join our newly established SRE team. Additionally, you’ll ensure that systems can handle increased load without compromising performance or reliability. This is ahybridrole based in our Chicago, Palo Alto or San Francisco office and will require you to be...

Promoted
TikTok
San Jose, California

Deliver tools/software to improve the reliability and scalability of services, automate operations and improve R&D efficiency. At least 2 years of work experience in SRE of large-scale systems deployment with high reliability and scalability. If you need assistance or a reasonable accommodation,...

Promoted
Bytedance
San Jose, California

Site Reliability Engineers (SRE) of the Applied Machine Learning (AML) team combines system engineering and the art of machine learning to develop and run massively distributed AI/recommendation systems around the world. On our site reliability engineering team, you'll have the opportunity to sharpe...

YO HR CONSULTANCY
San Jose, California

Database knowledge including SQL andnosql dbs. ...

Fractal
California

As a Site Reliability Engineer with Fractal, you will be dedicated to ensuring the highest system availability and performance levels. Please visit for more information about Fractal. Please Note: This role is specifically located in the North Bay area of San Francisco. You'll need to be onsite or h...

ByteDance
San Jose, California

About the team:Site Reliability Engineers (SRE) of the Applied Machine Learning (AML) team combines system engineering and the art of machine learning to develop and run massively distributed AI/recommendation systems around the world. On our site reliability engineering team, you'll have the opport...

Western Digital
Milpitas, California

As Secure Development Factory (SDF) Site Reliability Engineer - DevOps, you will be at the heart of Western Digital’s engineering process, delivering the software development tools and infrastructure that empowers engineering teams to develop and deliver high quality products quickly. You will play ...

ByteDance
San Jose, California

In less than a year, CapCut was released in Brazil, US, Indonesia, Japan and several other countries*. To better serve the diverse needs, CapCut released its online and PC version in 2022. Therefore, we set up an engineer team with high talent density, mainly focusing on AI technology and Privacy&Se...