Talent.com
Distinguished Software Architect - Deep Learning and HPC Communications

Distinguished Software Architect - Deep Learning and HPC Communications

MediabistroSanta Clara, CA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Distinguished Software Architect - Deep Learning and HPC Communications page is loaded

Distinguished Software Architect - Deep Learning and HPC Communications

Apply locations US, CA, Santa Clara time type Full time posted on Posted 2 Days Ago job requisition id JR

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

We are the GPU Communications Libraries and Networking team at NVIDIA. We deliver communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC. We are looking for a Distinguished Software Architect to help co-design our next generation data center platforms. DL and HPC applications have a huge compute demand already and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a node and with high-speed networking (eg. Infiniband, Ethernet) across the nodes. Communication performance between the GPUs has a direct impact on the end-to-end application performance; and the stakes are even higher at huge scales! This is an outstanding opportunity to push the limits on the state-of-the-art and deliver platforms the world has never seen before. Are you ready to contribute to the development of innovative technologies and help realize NVIDIA's vision?

What you will be doing :

Research new communication technologies (e.g. expand the GPUDirect technology portfolio) and design new features for our communication libraries

Propose innovative solutions in HW and SW for our next-gen platforms. You will co-design these solutions with the GPU, Networking, and SW architects and ensure seamless integration with the software stacks

Inspire changes based on quantitative data coming from proof-of-concepts or detailed technical analysis / modeling

Drive the adoption of new communication technologies across application verticals

Keep up with the latest DL research and collaborate with diverse teams (internal and external), including DL researchers, and customers

What we need to see :

PHD in Computer Science, Computer Engineering or related field or strong equivalent experience; 15+ years of relevant experience in academia or the industry

Expert in following areas : HPC, parallel programming models (MPI, SHMEM), at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC), computer and system architecture, GPU architecture and CUDA

Deep understanding of various aspects of high performance networking from prior work experience : network technologies (Infiniband, Ethernet), network design, network topologies, network debug and performance analysis

Strong in at least a few of these areas : ML / DL fundamentals and how they tie to communications, parallel algorithms, fault tolerance and resiliency, competitive assessments, performance analysis and optimizations for parallel applications on large clusters, developing applications using DL Frameworks (PyTorch, TensorFlow)

Programming fluency with C or C++ for systems software development

Flexibility to work and communicate effectively across different HW / SW teams and timezones

Ways to stand out from the crowd :

Industry recognized leader in HPC / DL communications with history of patents, publications and conference talks and keynotes in areas relevant to this role

Influential role in industry standards (e.g. MPI, OpenSHMEM) and open source software (e.g. PyTorch, UCX, Open MPI)

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!

The base salary range is 308,000 USD - 471,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs (5)

Senior Software Architect - Deep Learning and HPC Communications

locations 6 Locations time type Full time posted on Posted 3 Days Ago

Senior Software Engineer, GPU Communications and Networking

locations US, CA, Santa Clara time type Full time posted on Posted 4 Days Ago

Senior Deep Learning Architect, LLM Inference

locations US, CA, Santa Clara time type Full time posted on Posted 26 Days Ago

#J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Software Architect • Santa Clara, CA, United States

Job_description.internal_linking.related_jobs
  • serp_jobs.job_card.promoted
Informatica IDMC Architect

Informatica IDMC Architect

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a Technical Architect - Informatica IDMC & Cloud Data Integration.Key Responsibilities : Architect and develop scalable cloud data integration capabilities across Azure an...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
Security Architect

Security Architect

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Security Architect to lead the design and evolution of security architectures for enterprise-level application modernization efforts. Key Responsibilities Lead the devel...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
AI Solutions Architect

AI Solutions Architect

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for an AI Solutions Architect to lead the design and implementation of intelligent automation and conversational AI solutions. Key Responsibilities : Lead technical discovery a...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
MLOps Architect

MLOps Architect

VirtualVocationsOakland, California, United States
serp_jobs.job_card.full_time
A company is looking for an MLOps Platform Architect to lead the design and implementation of machine learning platforms. Key Responsibilities Architect and build end-to-end MLOps platforms ensuri...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
Sales Solution Architect - Analog & Mixed-Signal Platforms

Sales Solution Architect - Analog & Mixed-Signal Platforms

SynopsysSunnyvale, CA, United States
serp_jobs.job_card.full_time
At Synopsys, we drive the innovations that shape the way we live and connect.Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines.We lead in c...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
Machine Learning Engineer, Audio Perception

Machine Learning Engineer, Audio Perception

WaymoSan Francisco, CA, United States
serp_jobs.job_card.full_time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
IAM Security Architect

IAM Security Architect

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for an IAM and Security Services Architect.Key Responsibilities Define IAM and security services architecture roadmap, standards, and reference models Architect identity sol...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
System Architect, Quantum Networking

System Architect, Quantum Networking

PsiQuantumPalo Alto, CA, United States
serp_jobs.job_card.full_time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Senior Machine Learning Engineer, Audio Perception

Senior Machine Learning Engineer, Audio Perception

WaymoMountain View, CA, United States
serp_jobs.job_card.full_time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Senior Deep Learning Engineer

Senior Deep Learning Engineer

VirtualVocationsSan Francisco, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior Deep Learning Software Engineer, Inference and Model Optimization.Key Responsibilities Train, develop, and deploy generative AI models using the company's AI sof...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Solution Architect - Presales

Solution Architect - Presales

Informatica LLCRedwood City, CA, United States
serp_jobs.job_card.full_time
Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous minds eager to solve the world's most...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Software Architect

Software Architect

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Software Architect.Key Responsibilities Design comprehensive technical solutions that address complex business challenges and identify architectural improvements Conti...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Software Engineer, Machine Learning / Computer Vision

Software Engineer, Machine Learning / Computer Vision

WaymoSan Francisco, CA, United States
serp_jobs.job_card.full_time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Presales Solution Architect

Presales Solution Architect

Informatica LLCRedwood City, CA, United States
serp_jobs.job_card.full_time
Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous minds eager to solve the world's most...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
System Architect, Simulations & Models

System Architect, Simulations & Models

PsiQuantumPalo Alto, CA, United States
serp_jobs.job_card.full_time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Deep Learning Software Engineer

Deep Learning Software Engineer

VirtualVocationsSan Jose, California, United States
serp_jobs.job_card.full_time
A company is looking for a Deep Learning Software Engineer, Inference and Model Optimization - New College Grad 2025.Key Responsibilities Train, develop, and deploy generative AI models using the...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Senior Software Engineer, Machine Learning / Computer Vision

Senior Software Engineer, Machine Learning / Computer Vision

WaymoMountain View, CA, United States
serp_jobs.job_card.full_time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
Security Architect Engineer

Security Architect Engineer

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Security Architect / Engineer to design and implement secure enterprise architectures for a Department of Defense information system. Key Responsibilities Lead the design ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
Solution Architect

Solution Architect

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Solution Architect.Key Responsibilities Lead initial solution design discovery sessions and overall strategy for implementation projects Design, document, and communic...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Senior Solutions Architect

Senior Solutions Architect

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior Solution Architect (DHS).Key Responsibilities Lead the development and execution of strategic data analytics and reporting solutions Design and implement comple...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30