AIML - Sr Machine Learning Performance Engineer, Siri and Information Intelligence

Apple Inc.
Cupertino, California, US
Full-time
We are sorry. The job offer you are looking for is no longer available.

AIML - Sr Machine Learning Performance Engineer, Siri and Information Intelligence

The Siri team in the AIML group at Apple is seeking an exceptional Machine Learning Engineer to lead efforts in identifying bottlenecks and optimizing our model inference stack.

In this highly collaborative role, you will be at the center of multiple initiatives to accelerate and optimize LLMs and other ML models used by Siri.

This position involves consulting with multiple product teams to determine the appropriate foundation model (On Device vs Server) for their use cases and to help them achieve their accuracy and performance targets.

Your work will directly impact Siri's performance and efficiency, enhancing the overall user experience. You will be expected to bring innovative ideas and a problem-solving mindset to tackle the unique challenges associated with optimizing complex ML models.

Read on to find out what you will need to succeed in this position, including skills, qualifications, and experience.

Description

As a Machine Learning Performance Engineer, you will play a critical role in ensuring the efficiency and scalability of Siri's machine learning models.

You will work closely with diverse teams to diagnose performance issues and develop innovative solutions that enhance model performance.

Your expertise will be pivotal in shaping the future of Siri's AI capabilities.

  • Analyze and optimize the performance of machine learning models and systems used by Siri.
  • Develop and implement strategies for model tuning, parameter optimization, and efficient resource usage.
  • Conduct performance benchmarking and develop tooling and metrics to measure model performance in terms of compute, memory, and latency.
  • Collaborate with feature and product teams to consult on modeling decisions to achieve Siri performance objectives.
  • Collaborate with hardware and software teams to integrate research findings into product implementation.

Minimum Qualifications

  • Strong understanding of Transformer and LLM architectures.
  • Strong understanding of Operating System, Compiler, and Computer Architecture fundamentals. Expertise in optimizing software to take advantage of underlying hardware architecture.
  • Experience in analyzing, identifying, and optimizing performance bottlenecks.

Preferred Qualifications

  • Strong plus if you have expertise in optimizing model architectures for on-device inference.
  • Strong plus if you have previously worked with modeling pipeline teams in model deployment and promotion pipelines.
  • Creative, collaborative, and product-focused.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.

Learn more about your EEO rights as an applicant.

J-18808-Ljbffr

13 days ago
Related jobs
Promoted
Pinterest
Palo Alto, California

In this role, you will be responsible for developing and executing a vision for the evolution of the machine learning technology stack for the conversion modeling team. You will work on tackling new challenges such as user sequence modeling, embedding features, model quantization, utility alignment,...

Promoted
TikTok
San Jose, California

Build highly scalable machine learning systems and state-of-the-art machine learning models to improve ads ranking quality and optimize advertisers' marketing strategies. Understand ads platform objectives and take full advantage of modern machine learning to improve ads relevance, quality, and quan...

Promoted
TikTok
San Jose, California

The team is made up of machine learning researchers and engineers, who support and innovate on production recommendation models and drive product impact. Experience in one or more of the following areas: applied machine learning, machine learning infrastructure, large-scale recommendation systems, m...

Promoted
TikTok
Mountain View, California

Optimize video classification, multimodal content mining, and multimodal content understanding of e-commerce short videos and live broadcasts, and optimize the e-commerce short video shopping experience. Solid foundation in data structures/algorithms, proficient in machine learning/deep learning the...

Promoted
Acceler8 Talent
CA, United States

Keywords: Research Engineer, Pretraining, AI Studio, Language Models, Fine-Tuning, Transformer Models, Deep Learning, PyTorch, Distributed Training, Horovod, DeepSpeed, Large-Scale Training, Compute Resources, Innovation, User Feedback, Vertical Integration, AI Development, Model Architecture, Engin...

Promoted
DaVita Inc.
San Jose, California

Documentation and Best Practices: Establish an effective process for machine learning and security operations, and maintain clear documentation of models, data pipelines, and security procedures. Continuous Learning and Adaptation: Commitment to staying abreast of the latest research and trends in d...

Promoted
TikTok
San Jose, California

Our company benefits are designed to convey company culture and values, to create an efficient and inspiring work environment, and to support our employees to give their best in both work and life. Governance and Experience team of e-commerce are responsible for ensuring the safety and trustworthine...

Karkidi
Palo Alto, California

As a remote Senior Machine Learning Engineer, you'll report to our Engineering Manager, Machine Learning and work with machine learning PhDs from top schools (e. Help build new machine learning prediction delivery systems - all of our products are built from the ground up with machine learning at th...

Annapurna Labs (U.S.) Inc.
Cupertino, California

SDM of Software Development for the Machine Learning Distributed Training, Core Technologies and Infra org, you will be responsible for leading a strong teams of software engineers and managers to help design and deploy a software that enables ML workloads work seamlessly on these new products. We h...

Bytedance
San Jose, California

Development of machine learning systems, including key computing development, task scheduling, and machine learning system management and operation. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Do...