Search jobs > Cupertino, CA > Manager site reliability

Senior Manager, Data Site Reliability Engineering, Ad Platforms

Apple
Cupertino
Full-time

Summary :

At Apple, we work every day to build products that enrich people’s lives. Our Advertising Platforms group makes it possible for people around the world to easily access informative and imaginative content on their devices while helping publishers and developers promote and monetize their work.

Today, our technology and services power advertising in Search Ads in the App Store and Apple News. Our platforms are highly-performant, deployed at scale, and setting new standards for enabling effective advertising while protecting user privacy.

The Ad Platforms team is seeking a Senior Manager for leading Data Site Reliability Engineering. Our mission is to enable Ad Platforms to deliver advertisements in a reliable and scalable way that results in fantastic user experiences!

Key Qualifications :

Expert understanding in Linux based systems and deep expertise in Hadoop / YARN / Spark based technologiesHands on experience with AWS / EMR, S3, Glue, Athena and Kubernetes Infrastructure Expertise in designing, implementing and administering large Hadoop clusters and related Infrastructure such as Hive, Spark, HDFS, HBase, Oozie, Presto, Flume, Airflow and Zookeeper 5+ years managing clustered services, distributed systems, production data stores 5+ years leading teams in multiple locations Experience in managing the life cycle of data services from inception and design to deployment, operation, migration, administration and sunsets Experience in running Machine Learning pipelines (Training models, experimentation) and Jupyterhub / GPU compute / pytorch Infrastructure Cloudera CDH5 / CDH6 / CDP cluster management and prior capacity planning experience for large scale multi tenant clusters Ability to code well in at least one language (Shell, Ruby, Python, Java, Perl) Experience in setup / management of security infrastructure such as Kerberos Good work attitude and tenacious troubleshooting / analytical skills Multi-datacenter deployment / Disaster Recovery experience is a plus Prior Advertising and related data pipeline (click stream etc.

experience is a plus! A passion to reinforce and enrich an engineering team environment, driving team engagement and satisfaction and most meaningfully, a sense of humor and an eagerness to learn

Description :

Design and implement scalable data platforms for our customer facing services Monitor production, staging, test and development environments for multiple teams in an agile / dynamic fast paced engineering organization Deploy and scale Hadoop infrastructure to support data pipeline and related services Build infrastructure capabilities to improve resiliency and efficiency of the systems and services at scale Drive data infrastructure / pipeline, services and upgrade / migration projects from start to finish Support in Hadoop / HDFS infrastructure day today operations, administration and maintenance Data cluster monitoring and troubleshooting Capacity planning, management, and troubleshooting for HDFS, YARN / MapReduce and Spark work loads Participate in rotational on-call schedule Partner with program management, network engineering and other multi-functional teams on the larger initiatives Work simultaneously on multiple projects contending for your time and understand how to prioritize them accordingly Build and drive automation capabilities for the organization

Additional Requirements :

2 days ago
Related jobs
Promoted
RingCentral, Inc
Belmont, California

Lead by example, mastering both fundamental and advanced system functions from product and engineering perspectives. Lead and organize cross-team collaboration to address and resolve issues. Contact Center and AI-powered adjacencies. We invest more than $250 million annually to ensure our AI-enabled...

Promoted
Conductor
San Jose, California

The Senior Manager, Foundry Customer Engineering (CE) will manage the Samsung foundry customer's programs in various process technologies. Location: Onsite at our San Jose headquarters 3+ days a week with an average of. Is this your next job Read the full description below to find out, and do not he...

Promoted
MongoDB
Palo Alto, California

The worldwide data management software market is massive (According to IDC, the worldwide database software market, which it refers to as the database management systems software market, was forecasted to be approximately $82 billion in 2023 growing to approximately $137 billion in 2027. We are the ...

Promoted
Character.AI
Menlo Park, California

As a Multimodal Site Reliability Engineer (SRE) at Character, you will be responsible for ensuring the reliability, scalability, and performance of our app and AI multimodal services (e. AI is one of the world’s leading personal AI platforms. Given our current pace of growth and load on our systems,...

Advanced Micro Devices, Inc
Santa Clara, California

AMD together we advance_ Senior Software Engineering Manager, AI Software Solutions THE ROLE: Would you like to be part of a world class team enabling Machine Learning applications for world class datacenters and the mightiest supercomputers? AMD is searching for talented and highly motivated Softwa...

Cisco
San Jose, California

Cisco’s Cloud Security Engineering team is seeking an experienced and accomplished Engineering Leader to lead the implementation of automated tools and frameworks to help scale Cisco’s cloud security program. The successful candidate is excited to embrace a culture of innovation, demonstrate ownersh...

Palo Alto Networks
Santa Clara, California

Deep technical experts and thought leaders that help accelerate adoption of the very best engineering practices, while maintaining knowledge on industry innovations, trends and practices. Familiarity with Agile (, Scrum Process), Big Data technologies like Hive, Kafka, Hadoop, SQL, developing APIs, ...

DoorDash
Sunnyvale, California

The Data Engineering team partners with engineering organizations and stakeholders from finance, accounting and product teams to understand the data needs of the business and produce pipelines, data marts and other data solutions that enable better product and growth decision-making. Partner closely...

Apple
Cupertino, California

The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. Analyze logs and telemetry data by writing monitoring and automation code. Participate in on-call and release manager rotations. ...

Juniper Networks
Sunnyvale, California

The software engineering group comprises of highly skilled professionals who are responsible in delivering production grade quality release and products through agile and adaptive engineering methodologies. The role of software engineering senior manager plays an important role by leading and managi...