Search jobs > San Jose, CA > Principal ml engineer

Principal ML Performance Engineer

AMD
San Jose, CA, US
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.

Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges.

We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance

THE ROLE :

We seek a Principal Machine Learning Performance Engineer to focus on ML Performance modeling, projection, and optimization for various ML workloads, and participate in hardware and software co-design.

You will focus on the interaction between ML workloads and hardware architecture, including modeling workloads such as generative AI models on multiple HW configurations and summarize your recommendation.

Furthermore, you will be working with customers and business units on perf projection, analysis, and come up with novel solutions to satisfy customer needs.

If you are passionate about performance optimization, getting the best out of the HW, and shaping the future AI acceleration, then this role is for you.

THE PERSON :

As an ML Performance Engineer, you will analyze and explore recent ML models, understand their compute and memory requirements, and provide projection on various compute hardware for both inference and training.

In addition, you will of profile and analyze various workloads on current hardware and come up with new ways to improve their performance.

The ideal candidate will have strong experience with ML hardware architecture, software optimization and performance modeling.

KEY RESPONSIBILITIES :

  • Performance modeling and analysis of ML training and inference workloads across a single and multiple accelerators. In addition of exploring various tradeoff and design decision.
  • Participate in hardware-software co-design for future hardware optimization on various ML workloads.
  • Communicate and present the results of the performance analysis and modeling to stakeholders and provide a concrete recommendation.
  • Develop and improve our framework, tools and infrastructure for performance estimation, modeling and reporting.
  • Cross team collaboration.

PREFERRED EXPERIENCE :

  • Strong technical expertise and experience in performance analysis, projection, and ML hardware architecture.
  • Experience with software optimization and performance modeling, including modeling workloads such as generative AI models on multiple HW configurations.
  • Excellent written, verbal, and presentation skills.

CREDENTIALS :

A PhD or master's degree, plus equivalent experience in computer science, electrical engineer, or a related field.

LOCATION :

San Jose or Seattle; other US locations may be considered.

LI-MV1

LI-HYBRID

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position.

You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD's Employee Stock Purchase Plan.

You'll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.

We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

18 days ago
Related jobs
Promoted
Advanced Micro Devices, Inc
San Jose, California

We seek a Principal Machine Learning Performance Engineer to focus on ML Performance modeling, projection, and optimization for various ML workloads, and participate in hardware and software co-design. As an ML Performance Engineer, you will analyze and explore recent ML models, understand their com...

Promoted
Acceler8 Talent
CA, United States

AI/ML Performance Engineer for LLM Acceleration. Unlike traditional approaches that generalize across all ML models, our focus is exclusively on large language models (LLMs), enabling our hardware and software to achieve unparalleled simplicity and performance. Expertise in software engineering and ...

Promoted
Qualcomm
Santa Clara, California

As a Windows Performance Developer, you'll be owning and driving analysis of complex system level performance aspects and coming up with solutions to optimize performance without impacting power consumption to deliver best in class software performance for next generation of Windows on Snapdragon de...

Promoted
Acceler8 Talent
CA, United States

Join Our Team as an ML Performance Engineer. As an ML Performance Engineer at our company, you'll have the opportunity to:. As an ML Performance Engineer you will:. Profile and enhance the performance of ML workloads across various platforms, such as Nvidia, Apple, and Qualcomm. ...

Advanced Micro Devices, Inc
San Jose, California

AMD together we advance_ THE ROLE: In this team you will be building the compiler technology used to accelerate the latest AI models on AMD CPUs addressing the areas such as vision models, speech recognition, working with the leading engineers in AMD’s CPU, GPU and Adaptable Compute teams THE PERSON...

Cisco
San Jose, California

As a Principal Engineer, you will have the incredible opportunity to be at the forefront of artificial intelligence, machine learning, and deep learning research and development. We are looking for a passionate AI/Data Science Principal Engineer to join us on the sprint that takes us from ideas to p...

Aurora
Mountain View, California

We’re searching for a Software Engineer to focus on ML Accelerators. Surface high-impact findings to relevant Engineering leadership, keeping feedback loop going to influence Aurora’s ML strategy. Work closely with our autonomy and hardware teams to understand our on-vehicle ML technology. Maintain ...

NVIDIA
Santa Clara, California

This team focuses on optimizing efficiency and resiliency of ML workloads, as well. Build tools and frameworks that provide real time application performance metrics that can be correlated with system metrics . Collaborate with software teams to pinpoint performance bottlenecks. Design, prototy...

SK HYNIX INC
San Jose, California

Principal Engineer, Principal Engineer, Array Performance and Reliability. Array Engineering, TD-Reliability, Media Integration)Activities to have world-class array reliability and performance through array process development and finding best array operating condition from initial development stage...

NetApp
San Jose, California

You will work closely with architects across various engineering development teams to drive performance agenda, establish performance direction, guide performance team's technical work across all projects. Performance engineers focus on performance analysis and improvement for new products and featu...