Senior Deep Learning Performance Engineer

Kuraray America, Inc.

Santa Clara, California, US

Full-time

We are now looking for a Senior Deep Learning Performance Engineer!

Apply now, read the job details by scrolling down Double check you have the necessary skills before sending an application.

Do you want to help drive the development of high-performance, power-efficient datacenter solutions for Large Language Models (LLMs)?

Do you have an interest in how system architecture across GPU, networking, CPU and IO relate to brand new generative AI capabilities?

Come join our team, and bring your experience and interests to help us optimize our next generation of datacenter hardware, deep learning frameworks and to redefine the deep learning industry once again.

What you'll be doing :

Characterize a broad range of generative AI applications running on NVIDIA datacenters, based on their system characteristics.
Find opportunities to improve efficiency across aspects ranging from CPU / GPU / System architecture to GPU Kernels and Libraries, and DL Frameworks.
Develop analysis and lightweight profiling tools and methodologies to measure key performance metrics and to estimate potential for efficiency improvement.
Work with key partners in Datacenter Design Teams, silicon IP Architecture Teams, as well as DL framework teams to evolve the balance of silicon resources for NVIDIA's next generation datacenters.

What we need to see :

A Master’s degree in Electrical Engineering, Computer Science or Computer Engineering or equivalent experience. PhD is a plus.
6+ years of relevant work experience with exposure to any one of the two.
System software development and performance analysis for parallel computing setups. Prior experience with performance aspects of Operating system intrinsics (e.

g. : Linux scheduling), GPU kernels (CUDA), or DL Frameworks (PyTorch, TensorFlow).

Silicon performance analysis of High Performance Computing or Deep Learning Systems with commonly used silicon performance monitoring and profiling tools (perf / gprof / nvidia-smi / dcgm).

In depth performance modeling experience in any one of CPU, GPU, Memory or Network Architecture is a plus.

Proficiency in programming (Python, C / C++). Exposure to Containerization Platforms (docker) and Datacenter Workload Managers (slurm) would be viewed favorably.
Knowledge of LLMs and their intrinsics is desired.
Ability to plan, own and drive involved tasks from beginning to end, and to coordinate activities between team members.

Familiarity with multi-site teams or multi-functional teams is an added advantage.

NVIDIA invented the GPU in 1999 and over the years, fueled the growth of PC gaming, redefined computer graphics, and revolutionized parallel computing.

More recently, NVIDIA has orchestrated the development and adoption of culture-changing technologies in Artificial Intelligence and Cryptography.

Combining extraordinary graphics processors with innovative CPUs and pioneering networking technologies like Infiniband, NVIDIA has broken ground and set standards of excellence in an array of important and exciting applications.

As NVIDIA leads the industry in AI, our team plays a central role in getting the most out of our exponentially growing datacenter deployments as well as establishing a data-driven approach to system design.

We collaborate with a broad cross section of teams at NVIDIA ranging from DL research teams to CUDA Kernel and DL Framework development teams, to Silicon Architecture Teams.

As our team grows, and as we seek to identify and take advantage of long term opportunities, our needs are expanding as well.

With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers.

We have some of the most forward-thinking and hardworking people in the world working with us and, due to unprecedented growth, our best-in-class engineering teams are rapidly growing.

If you're a creative and autonomous engineer with a real passion for both silicon architecture and software performance and efficiency, we want to hear from you!

The base salary range is $176,000 - $333,500. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

J-18808-Ljbffr

11 days ago

Related jobs

Promoted

Senior Deep Learning Algorithm Engineer - Agentic LLM Inference

VirtualVocations

Santa Clara, California

A company is looking for a Senior Deep Learning Algorithm Engineer - Agentic LLM Inference. ...

Promoted

AIML - Senior Machine Learning Engineer, Computer Vision, Siri and Information Intelligence

Apple

Cupertino, California

We are seeking a highly skilled Senior Machine Learning Engineer specializing in Computer Vision to join our dynamic team. As a Machine Learning Engineer within the Siri team, you will define new approaches for evaluating innovative computer vision algorithms and models to enhance product capabiliti...

Promoted

Senior Machine Learning Engineer

CARIAD, Inc.

Mountain View, California

As a Senior Machine Learning Engineer at CARIAD, Inc. Optimize machine learning models to run efficiently on compute-limited devices, such as those embedded in vehicles, enhancing performance without compromising on quality. SDV Hub, you will play a pivotal role in shaping the future of the automoti...

Promoted

Senior Machine Learning Operations Engineer

Mendel.ai

San Jose, California

Our comprehensive clinical intelligence platform, Hypercube, combines advanced machine learning with deep clinical understanding to deliver precise, actionable insights from complex data sets. We seek a highly skilled and motivated Machine Learning Engineer to join our dynamic team. In this role, yo...

Promoted

Senior Applied Deep Learning Research Scientist, multi-modal LLMs

NVIDIA

Santa Clara, California

NVIDIA is searching for expert researchers in audio, deep learning, generative models, and large language models to join our applied deep learning research team that pioneered Megatron, DLSS, and several models for Audio (BigVGAN, One TTS Aligner To Rule Them All, RADMM, ZenFlow, P-Flow). We are now...

Promoted

Senior Deep Learning Scientist, Conversational AI

The Learning Experience #351

Santa Clara, California

Senior Deep Learning Scientist, Conversational AINVIDIA. NVIDIA is leveraging AI to advance High-Performance Computing and visualization technologies, serving as the brain of modern computing systems. ...

Samsung Ads - Senior Machine Learning Engineer – Data Science

SAMSUNG

Mountain View, California

You will also work with talented engineers and top-notch machine learning researchers on exciting projects and state-of-the-art technologies. Closely work with the machine learning team to define and improve machine learning products. We are exploring the latest data mining and machine learning tech...

Senior Software Engineer, Machine Learning - Personalization & Growth

DoorDash

Sunnyvale, California

As a Senior Machine Learning Scientist, you’ll be conceptualizing, designing, implementing, and validating algorithmic improvements to the search and personalization experiences at the heart of our fast growing grocery and retail delivery business. Expertise in applied ML for Search/NLP/IR/Product K...

Senior Machine Learning Engineer - Code AI/GPT

ByteDance

San Jose, California

We are seeking a top-notch AI/ML Engineer, who is fluent in programming language semantics, to help us create the world's best development tools. Join us today, and we'll empower you to build large-scale machine learning systems, raising dev productivity to unprecedented levels with powerful AI/ML t...

Deep Learning Compiler Engineer for Ryzen AI NPU

AMD

San Jose, California

We are looking for a talented Machine Learning (ML) Compiler SW Engineer to join our growing team in the AI group and play a crucial role in developing SW toolset to deploy cutting-edge ML models on AMD's XDNA Neural Processing Units (NPU). Collaborate with architects and runtime software engineers ...

Senior Deep Learning Performance Engineer

Senior Deep Learning Algorithm Engineer - Agentic LLM Inference

AIML - Senior Machine Learning Engineer, Computer Vision, Siri and Information Intelligence

Senior Machine Learning Engineer

Senior Machine Learning Operations Engineer

Senior Applied Deep Learning Research Scientist, multi-modal LLMs

Senior Deep Learning Scientist, Conversational AI

Samsung Ads - Senior Machine Learning Engineer – Data Science

Senior Software Engineer, Machine Learning - Personalization & Growth

Senior Machine Learning Engineer - Code AI/GPT

Deep Learning Compiler Engineer for Ryzen AI NPU

Related searches