Talent.com
Software Manager, AI Infrastructure System
Software Manager, AI Infrastructure SystemNVIDIA • US
serp_jobs.error_messages.no_longer_accepting
Software Manager, AI Infrastructure System

Software Manager, AI Infrastructure System

NVIDIA • US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to address, that matters to the world, and that only we can address. This is our life’s work, to amplify human imagination and intelligence, and expand what is possible. We’re seeking strategic, bold, hard-working, and creative individuals who are passionate about helping us tackle challenges no one else can solve. Make the choice to join us today.

We are looking for a n AI Infrastructure System Software Manager to join our mission to continue improving our HPC infrastructure. Our team builds and operates sophisticated infrastructure to enable business critical services and AI applications. You will be working with a team of passionate and skilled engineers that are continuously working to provide better tools to build and manage this i nfras tru cture . Ideal candidate is strong in software development, designing and creating reliable distribute d system s, and has the abi lit y to imp leme n t well though t out lo ng term maintenance strategy.

What you'll be doing :

Mentor, grow, and develop a world-class team of AI infrastructure engineers.

Work across several teams and orgs to build products that use LLMs and agent systems to serve the needs of NVIDIA engineering teams. In that role, you will be collaborating with research and infra teams and serve a large user base (hardware / software teams across NVIDIA).

Align priorities across collaborators and define metrics for measuring the success of the product / team.

Develop and execute strategies for scalable, reliable, and secure AI infrastructure supporting both research and production workloads.

Ensure robust monitoring, logging, visualization, and alerting capabilities to guarantee promised uptime and operational excellence.

Architect, design, develop, and maintain infrastructure and large-scale applications for LLM-based solutions. Optimize these systems for performance, scalability, reliability, and secure data management.

Stay updated with the latest trends in AI, ML, and infrastructure, proactively seeking opportunities to integrate advancements into Nvidia’s LLM and AI infrastructure solutions.

What we need to see :

10+ overall years of industry large distributed system software development experience.

BS+ degree in CS or related / equivalent experience.

5+ years of experience managing of AI and SW development teams.

Familiarity with modern software development stacks and tools, including containerization, cloud or on-premises deployments, API integration for seamless model operation, and real-time processing frameworks.

Experience in developing and maintaining LLM or GenAI infrastructure

Excellent communication, collaboration and problem-solving skills, with a dedication to encouraging an inclusive and diverse workplace.

Hands-on experience developing large-scale distributed systems

Ways to stand out from the crowd :

Strong technical background in cloud / distributed infrastructure

Experience debugging functional and performance issues in HPC GPU clusters

Background in running and instrumenting distributed LLM training on a multi GPU HPC cluster

Experience with HPC schedulers such as Slurm

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 3, and 272,000 USD - 425,500 USD for Level 4.

You will also be eligible for equity and benefits .

Applications for this job will be accepted at least until July 29, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

serp_jobs.job_alerts.create_a_job

Software Infrastructure • US

Job_description.internal_linking.related_jobs
Design Systems Manager

Design Systems Manager

Akaasa Technologies • United States
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Design Systems Manager Remote, hybrid if the candidate resides in Miami, FL • •Include a portfolio • • &...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
CRNA - Anesthesiology job available in Parsons, Kansas

CRNA - Anesthesiology job available in Parsons, Kansas

Source Medical, LLC. • Parsons, KS, US
serp_jobs.job_card.full_time +1
CRNA opening in KansasLocated in Parsons, KS - Wichita 125m, Kansas City 150mFull-time, permanent positionSeeking BC / BECRNA needed to join busy team. Newly renovated surgery department.Surgery mix i...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Infrastructure Architect

Infrastructure Architect

Cyber Resource • US
serp_jobs.job_card.full_time
Virginia State University (VSU) is seeking an Enterprise Architect (EA) to design and diagram the institution's IT platforms, solutions, network and infrastructure in alignment with business g...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Store Manager Wanted (MOD)

Store Manager Wanted (MOD)

Tate Boys Tire & Service • Bartlesville, OK, US
serp_jobs.job_card.full_time
Manager on Deck (MOD) – Incredible Earning Potential + Career Growth!.Tate Boys Tire & Auto Service – Trusted Since 1936. Ready to fast-track your career in automotive leadership?.Ta...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Remote Side Hustle Developer

Remote Side Hustle Developer

Finance Buzz • Parsons, Kansas, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
This position is for individuals who want to develop a side income stream while still working full time.You will test different small-scale remote opportunities, learn what works, and grow what pro...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
ForgeRock Identity Manager Architect / Engineer

ForgeRock Identity Manager Architect / Engineer

Cloud Security Services • US
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Hybrid Pathways is currently looking for an experienced ForgeRock Identity Management Engineer Lead for our client.Our client requires a ForgeRock Identity Management Engineer Lead to deploy ForgeR...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Ping Identity and Access Manager Architect (Remote)

Ping Identity and Access Manager Architect (Remote)

Cloud Security Services • (Multiple States), US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
PING CERTIFICATION REQUIRED Job Title : .Ping Identity and Access Manager Architect (Remote) Location : Fully Remote Company : CTI About Us : CTI is a leading technology company specializing...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Restaurant Delivery - Flexible Schedule

Restaurant Delivery - Flexible Schedule

DoorDash • Chelsea, OK, United States
serp_jobs.job_card.full_time +1
DoorDash is the #1 category leader in food delivery, food pickup, and convenience store delivery in the US, trusted by millions of customers every day. As a Dasher, you’ll stay busy with a variety o...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
CNA / Certified Nurse Aide

CNA / Certified Nurse Aide

Americare Senior Living • Independence, KS, US
serp_jobs.job_card.full_time
Are you looking to have fun while making a meaningful impact in the lives of seniors?.At Americare, our RISING Team Values guide everything we do : . We are proud to make a meaningful impact in the li...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Side Hustle Project Lead

Side Hustle Project Lead

Finance Buzz • Nowata, Oklahoma, US
serp_jobs.job_card.full_time +1
We’re offering a role for someone who wants to lead their own side-income project in their spare time.You’ll explore various proven side hustles, select the ones that fit your lifestyle, and run th...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Media Production Manager

Media Production Manager

L2 Realty, Inc • Independence, KS, US
serp_jobs.job_card.full_time
Media Production Manager .This role oversees the creation, management, and delivery of high-quality visual content designed to promote properties, agents, and enhance the brand image of our re...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
AI Infrastructure Engineer

AI Infrastructure Engineer

Match Point Solutions • United States
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
MatchPoint Solutions is a fast-growing, young, energetic global IT-Engineering services company with clients across the US. We provide technology solutions to various clients like Uber, Robinhood, N...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
CRNA - Anesthesiology job available in Parsons, Kansas

CRNA - Anesthesiology job available in Parsons, Kansas

Archway Physician Recruitment • Parsons, KS, US
serp_jobs.job_card.full_time +1
Newly renovated surgery department.Surgery mix includes Orthopedics, Neurosurgery, General Surgery, OB / GYN, Ophthalmology, Urology and ENT. Hospital Employee - Full-time - Permanent - Work independe...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Manager, AI / ML Infra Engineering

Senior Manager, AI / ML Infra Engineering

IntelliPro Group Inc. • (Multiple States), US
serp_jobs.job_card.full_time +1
serp_jobs.filters_job_card.quick_apply
Sr Manager, AI / ML Infra Engineering Position Type : Full-Time / Permanent Location : Remote (Must be based in Bay Area, Chicago, Boston for occasional in-person collaboration) Salary Rang...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Manager, Engineering (AI) (Remote - US)

Manager, Engineering (AI) (Remote - US)

Jobgether • US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a.This role offers the opportunity to lead a high-performing technical team focused on delivering in...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.new
Restaurant Delivery - Sign Up and Start Earning

Restaurant Delivery - Sign Up and Start Earning

DoorDash • Chelsea, OK, United States
serp_jobs.job_card.full_time +1
DoorDash is the #1 category leader in food delivery, food pickup, and convenience store delivery in the US, trusted by millions of customers every day. As a Dasher, you’ll stay busy with a variety o...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Principal Systems Engineer – Systems Architect

Principal Systems Engineer – Systems Architect

Raytheon • US
serp_jobs.job_card.full_time
CO102 : 16800 E Centretech Pkwy,Aurora 16800 East Centretech Pkwy Building S75, Aurora, CO, 80011 USA.Person, or Immigration Status Requirements : . The ability to obtain and maintain a U.At Raytheon, ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Restaurant Delivery - Sign Up in Minutes

Restaurant Delivery - Sign Up in Minutes

DoorDash • Chelsea, OK, United States
serp_jobs.job_card.full_time +1
DoorDash is the #1 category leader in food delivery, food pickup, and convenience store delivery in the US, trusted by millions of customers every day. As a Dasher, you’ll stay busy with a variety o...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Restaurant Delivery - Start Earning Quickly

Restaurant Delivery - Start Earning Quickly

DoorDash • Chelsea, OK, United States
serp_jobs.job_card.full_time +1
DoorDash is the #1 category leader in food delivery, food pickup, and convenience store delivery in the US, trusted by millions of customers every day. As a Dasher, you’ll stay busy with a variety o...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Case Manager

Case Manager

BrightSpring Health Services • Parsons, KS, United States
serp_jobs.job_card.full_time
Our operational team members focus on efficiently meeting the needs of our clients across various lines of business.If your passion is to ensure quality care to help our clients live their best lif...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted