Search jobs > Santa Clara, CA > Senior deep learning

Senior Deep Learning Scientist, LLM and Tools

Nvidia Corporation
Santa Clara, California, US
$148K-$276K a year
Full-time

Senior Deep Learning Scientist, LLM and Tools

Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization.

The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services.

GPU deep learning ignited modern AI the next era of computing with the GPU acting as the brain of computers, robots, autonomous cars and conversational AI that can perceive and understand the world.

Today, we are increasingly known as 'the AI computing company.' We're looking to grow our company, and build our teams with the hardest working people in the world.

Join us at the forefront of technological advancement.

Scroll down to find an indepth overview of this job, and what is expected of candidates Make an application by clicking on the Apply button.

NVIDIA is looking for Senior Deep Learning Scientist, LLM and Tools to develop high-impact, high-visibility Large language model products and improve the experience of millions of customers using our NeMo LLM MLOps platform.

If you're creative & passionate about solving real-world conversational AI problems, come join our Digital Human LLM team.

What you’ll be doing :

  • Develop, Train, Fine-tune, and Deploy multimodal large language models for retrieval augmented generation and tools usage.
  • Build LLM agent framework for reasoning and action prediction in a multimodal environment.
  • Apply instruction tuning, reinforcement learning from human feedback (RLHF), and parameter efficient fine-tuning such as p-tuning, adaptors, LoRA, and so on to improve LLMs for reasoning and action prediction.
  • Measure and benchmark model and application performance.
  • Analyze model accuracy and bias and recommend the next course of action & improvements.
  • Maintain model evaluation systems.
  • Drive the gathering, building, and annotation of domain-specific datasets to train LLMs for different tasks, tools, and applications.
  • Characterize performance and quality metrics across platforms for various AI and system components.
  • Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.
  • Help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment and collaborate with various teams on new product features and improvements of existing products.

What we need to see :

  • Master’s degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math with 5+ years of experience.
  • Understanding of LLM Agent development approaches for multi-step planning, reasoning, and tools interaction.
  • Hands-on experience with LLM agent frameworks including OpenAI functions, AutoGPT, BabyAGI, and Plan-and-execute agents.
  • Experience developing production LLM powered applications and tools with natural language interface.
  • Excellent programming skills in Python with strong fundamentals in programming, optimizations and software design.
  • Solid understanding of ML / DL techniques, algorithms and tools with exposure to CNN, RNN (LSTM), Transformers (BERT, BART, GPT / T5, Megatron, LLMs).
  • Experience with training BERT, GPT and Megatron Models for different NLP and dialog system application using PyTorch Deep Learning Frameworks and performing NLP data wrangling and tokenization.
  • Understanding of MLOps life cycle and experience with MLOps workflows & traceability and versioning of datasets including knowhow of database management and queries (in SQL, MongoDB etc).
  • Experience using end-to-end MLOps platform such as KubeFlow, MLFlow, AirFlow.
  • Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment.

Ways to stand out from the crowd :

  • Fluency in a non-English language - Spanish / Mandarin / German / Japanese / Russian / French / UK English / Arabic / Korean / Italian / Portuguese.
  • Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT.
  • Background with Dockers and Kubernetes and Strong C++ programming skills.
  • Background with deploying machine learning models on data center, cloud, and embedded systems.
  • Experience developing document extraction for different document types and sources, and indexing at scale as well as experience adapting LLMs to different domains such as automotive, health care, finance and so on.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers.

We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing.

If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!

The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits.

NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

J-18808-Ljbffr

2 days ago
Related jobs
Promoted
Walmart
Sunnyvale, California

You are a technically strong and high-performing individual with experience in machine learning and deep learning, excellent communication skills, proven analytical skills, and strong customer focus. You would be working on challenging problems in NLP leveraging Machine Learning and Deep Learning Te...

Promoted
Varian Medical Systems, Inc.
Palo Alto, California

Work with other scientists and engineers to review and solve difficult technical issues. We are part of an incredible community of scientists, clinicians, developers, researchers, professionals, and skilled specialists pushing the boundaries of what's possible, to improve people's lives around the w...

Promoted
Bosch USA
Sunnyvale, California

The Research and Technology Center North America (RTC-NA) is dedicated to providing technologies and system solutions for various Bosch business fields, primarily in the field of artificial intelligence (for example, human-assisted AI, natural language processing, robotics, 3D perception, and AI pla...

Promoted
Robotics Prcocess Automation, LLC
Palo Alto, California

Strong deep learning experience, particularly in applications of Neural Network architectures to Computer Vision, Natural Language Processing, Machine Intelligence and/or Reinforcement Learning. HP Labs Senior Machine Learning Research Scientist. Strong understanding of data management and model eva...

Promoted
Thetalkingmachines
Mountain View, California

From creating experiments and prototyping implementations to designing new architectures, Research Scientists and Software Engineers work on challenges in machine perception, data mining, machine learning, and natural language understanding while contributing to the wider research community by partn...

Promoted
Idaho Occupational Therapy Associaton
Palo Alto, California

Strong deep learning , particularly in applications of Neural Network architectures to Computer Vision, Natural Language Processing, Machine Intelligence and/or Reinforcement Learning. HP Labs Senior Machine Learning Research Scientist. Strong understanding of data management and model evaluation an...

Promoted
1st. Creative Learning Academy Inc.
Palo Alto, California

To determine starting pay, we consider multiple job-related factors including a candidate’s skills, education and experience, market demand, business needs, and internal parity. Access to collaborators, resources and facilities at our three partner universities (Stanford, UC Berkeley, and UC San Fra...

Walmart
Sunnyvale, California

The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmart’s accessibility standards and guidelines for supporting an inclusive culture. AI scientists and machine learning engineers dedicated to sol...

Databricks
Mountain View, California

The team will use statistical and machine learning techniques for fraud and abuse detection and design systems that prevent unauthorized access to Databricks and its customer data and infrastructure. You will collaborate with security engineers, trust and safety experts, and machine learning enginee...

NVIDIA
Santa Clara, California

Understand and analyze the potential vulnerabilities and risks in future systems for AI and AI-based systems. We are now looking for Senior Research Scientist, Security and Privacy. We are seeking candidates that have a proven track record of research excellence, a broad perspective on security/priv...