Sr ML Training GPU Optimization Engineer

Adobe

San Jose

$170.9K-$325.2K a year

Full-time

Our Company

Changing the world through digital experiences is what Adobe’s all about. We give everyone from emerging artists to global brands everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.

We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity.

We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!

What you will be working on :

Write efficient forward and backward passes in CUDA / CuTe.
Write Optimized custom layers in Pytorch.
Optimize ML training code for large, distributed training with FP8.
Quality and performance analysis between data types such as BF16 and FP8 for large deep learning models.
Understand and optimize H100 GPUs.
Architect broader, end to end optimized training code and schemes with Pytorch for large distributed models.
Write high quality, product level code that is easy to maintain and test following standard methodologies.

What do you need to succeed :

Proficiency in at least two of : Linux, Ansible, Docker, Kubernetes (5+ yrs)
Expert in Python and or C++
Expert in CUDA / CuTe, OpenCL and or Triton
Expert in Pytorch
Experience with DDP, FSDP
Experience in distributed computing (7+ yrs)
Experience working with AWS or similar cloud infrastructure (5+ yrs)
Experience with HW resource management for ML training and / or deployment
or in Computer Science, Computer Engineering or a related area

FireflyGenAI

Our compensation reflects the cost of labor across several geographic markets, and we pay differently based on those defined markets.

The pay range for this position is $170,900 $325,200 annually. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience.

Your recruiter can share more about the specific salary range for the job location during the hiring process.

At Adobe, for sales roles starting salaries are expressed as total target compensation (TTC base + commission), and short-term incentives are in the form of sales commission plans.

Non-sales roles starting salaries are expressed as base salary and short-term incentives are in the form of the Annual Incentive Plan (AIP).

In addition, certain roles may be eligible for long-term incentives in the form of a new hire equity award.

30+ days ago

Related jobs

Promoted

Sr. Staff GPU Graphics Virtualization Software Engineer

NIO

CA, United States

Partner with other engineering teams to understand real-world constraints and to support the high-quality implementation of GPU virtualization for vehicle product SW development, validation and integration. BS / MS in Electrical Engineering, Computer Engineering, Computer Science or equivalent. Arch...

Promoted

CAD ML Timing Optimization Engineer

Apple

Sunnyvale, California

As a CAD ML Timing Optimization Engineer, you will: - Deliver methodology and tool solutions for static timing closure and power optimization. Combining machine learning algorithm application with practical design know-how and software engineering best practices, you will help to differentiate and s...

Promoted

Sr. Data Scientist / ML Engineer - REF8522U

Zscaler

San Jose, California

Machine Learning Engineer or Data Scientist. Experience with feature engineering, model evaluation and model error analysis. Master's Degree in Computer Science/Engineering required, data science concentration is a plus; PhD is preferred. Passion for leveraging ML/AI to solve real-world business pro...

Promoted

Senior GPU Optimization Engineer

Adobe

San Jose, California

We are hiring for a highly strategic and visible role to apply GPU optimization skills towards improving the training efficiency and performance of these models. Leverage FP8 to accelerate training and inference. Optimize model efficiency for Hopper/Blackwell GPU architectures. Computer Science, Com...

Promoted

Sr. System Engineer (GPU/Server)

Supermicro

San Jose, California

Understanding of GPU technology and industry offerings such as Nvidia Tesla and AMD GPUs is desirable. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us. Senior System Product Engineer. This individual will also be the go-to person for product mana...

Promoted

Application Software Optimization Engineer, ML/AI

CV Library

Santa Clara, California

Responsibilities:Application Software Optimization Engineer, ML/AI. This position is for a senior level application optimization engineer in AI, with a focus on optimizing Machine Learning applications. AMD's Data Center GPU organization is transforming the industry with our AI-based Graphic Process...

Promoted

AIML - Sr Software Development Engineer in Test, ML Systems Evaluation Engineering

Apple Inc.

Cupertino, California

Join ML Systems Evaluation Engineering (MLSEE) team at Apple and contribute to a highly accomplished team that evaluates AIML products, that will delight and inspire billions of people!We are looking for Software Development Engineer in Test with a strong background and experience in Machine-Learnin...

Sr. Staff Engineer, GPU Performance Modeling

SAMSUNG

San Jose, California

GPU Performance Modeling Engineer, you will work as part of the GPU Architecture team where you will develop performance models and triage to fix performance-related issues for a highly efficient mobile GPU. You identify architecture, micro-architecture, and implementation optimizations to improve t...

Software Engineer- AI/ML, AWS Neuron Distributed Training

Annapurna Labs (U.S.) Inc.

Cupertino, California

The ML Distributed Training team works side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed training solutions with Trn1. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. Pref...

Sr GPU Architectural Modeling Engineer

AMD

Santa Clara, California

Knowledge of the GPU hardware and the efficient use of discrete GPUs, APUs, and mobile devices is an advantage. The tool is utilized to predict the performance of the next-gen product across different GPU families at AMD. Data from the AM model is used by GPU architects to identify areas that can be...

Sr ML Training GPU Optimization Engineer

Sr. Staff GPU Graphics Virtualization Software Engineer

CAD ML Timing Optimization Engineer

Sr. Data Scientist / ML Engineer - REF8522U

Senior GPU Optimization Engineer

Sr. System Engineer (GPU/Server)

Application Software Optimization Engineer, ML/AI

AIML - Sr Software Development Engineer in Test, ML Systems Evaluation Engineering

Sr. Staff Engineer, GPU Performance Modeling

Software Engineer- AI/ML, AWS Neuron Distributed Training

Sr GPU Architectural Modeling Engineer

Related searches