Search jobs > San Jose, CA > Principal ml engineer

Principal ML Performance Engineer

AMD
San Jose, CA, US
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.

Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges.

We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance

THE ROLE :

We seek a Principal Machine Learning Performance Engineer to focus on ML Performance modeling, projection, and optimization for various ML workloads, and participate in hardware and software co-design.

You will focus on the interaction between ML workloads and hardware architecture, including modeling workloads such as generative AI models on multiple HW configurations and summarize your recommendation.

Furthermore, you will be working with customers and business units on perf projection, analysis, and come up with novel solutions to satisfy customer needs.

If you are passionate about performance optimization, getting the best out of the HW, and shaping the future AI acceleration, then this role is for you.

THE PERSON :

As an ML Performance Engineer, you will analyze and explore recent ML models, understand their compute and memory requirements, and provide projection on various compute hardware for both inference and training.

In addition, you will of profile and analyze various workloads on current hardware and come up with new ways to improve their performance.

The ideal candidate will have strong experience with ML hardware architecture, software optimization and performance modeling.

KEY RESPONSIBILITIES :

  • Performance modeling and analysis of ML training and inference workloads across a single and multiple accelerators. In addition of exploring various tradeoff and design decision.
  • Participate in hardware-software co-design for future hardware optimization on various ML workloads.
  • Communicate and present the results of the performance analysis and modeling to stakeholders and provide a concrete recommendation.
  • Develop and improve our framework, tools and infrastructure for performance estimation, modeling and reporting.
  • Cross team collaboration.

PREFERRED EXPERIENCE :

  • Strong technical expertise and experience in performance analysis, projection, and ML hardware architecture.
  • Experience with software optimization and performance modeling, including modeling workloads such as generative AI models on multiple HW configurations.
  • Excellent written, verbal, and presentation skills.

CREDENTIALS :

A PhD or master's degree, plus equivalent experience in computer science, electrical engineer, or a related field.

LOCATION :

San Jose or Seattle; other US locations may be considered.

LI-MV1

LI-HYBRID

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position.

You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD's Employee Stock Purchase Plan.

You'll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.

We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

20 days ago
Related jobs
Promoted
Acceler8 Talent
CA, United States

Join Our Team as an ML Performance Engineer. As an ML Performance Engineer at our company, you'll have the opportunity to:. As an ML Performance Engineer you will:. Profile and enhance the performance of ML workloads across various platforms, such as Nvidia, Apple, and Qualcomm. ...

Advanced Micro Devices, Inc
San Jose, California

AMD together we advance_ THE ROLE: We seek a Principal Machine Learning Performance Engineer to focus on ML Performance modeling, projection, and optimization for various ML workloads, and participate in hardware and software co-design. THE ROLE: We seek a Principal Machine Learning Performance Engi...

Cisco
San Jose, California

As a Principal Engineer, you will have the incredible opportunity to be at the forefront of artificial intelligence, machine learning, and deep learning research and development. We are looking for a passionate AI/Data Science Principal Engineer to join us on the sprint that takes us from ideas to p...

Cisco Systems, Inc.
San Jose, California

As a Principal Engineer, you will have the incredible opportunity to be at the forefront of artificial intelligence, machine learning, and deep learning research and development. We are looking for a passionate AI/Data Science Principal Engineer to join us on the sprint that takes us from ideas to p...

Waymo
Mountain View, California

To achieve our mission, we architect and create high-performance custom silicon; we develop system-level compute architectures that push the boundaries of performance, power, and latency; and we collaborate closely with many other teammates to ensure we design and optimize hardware and software for ...

NetApp
San Jose, California

You will work closely with architects across various engineering development teams to drive performance agenda, establish performance direction, guide performance team's technical work across all projects. Performance engineers focus on performance analysis and improvement for new products and featu...

NVIDIA
Santa Clara, California

This team focuses on optimizing efficiency and resiliency of ML workloads, as well. Build tools and frameworks that provide real time application performance metrics that can be correlated with system metrics . Collaborate with software teams to pinpoint performance bottlenecks. Design, prototy...

Waymo
Mountain View, California

Senior Software Engineer, ML Performance. Report into the TLM of ML performance. Prior experience optimizing ML model performance e. Profile and debug model performance bottlenecks. ...

NVIDIA
Santa Clara, California

This includes performing in-depth analysis and optimization to ensure the best possible performance on the current and future generations of NVIDIA CPUs. Influence the design of NVIDIA next-generation architectures and software stack by investigating the impact on application performance and develop...

Advanced Micro Devices, Inc
San Jose, California

AMD together we advance_ THE ROLE: In this team you will be building the compiler technology used to accelerate the latest AI models on AMD CPUs addressing the areas such as vision models, speech recognition, working with the leading engineers in AMD’s CPU, GPU and Adaptable Compute teams THE PERSON...