Search jobs > Santa Clara, CA > Permanent > Senior engineer infrastructure

Senior Systems Full Stack Engineer - AI Infrastructure

NVIDIA
Santa Clara, California, US
$144K-$270.3K a year
Permanent

NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence, and Visualization.

The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services.

Our work opens up new universes to explore, enables outstanding creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have strong programming skills, a deep understanding of software building and packaging, especially related to GO and Python, experience with distributed systems, as well as excellent communication and planning skills.

We also welcome out-of-the-box thinkers who can provide new ideas while being strong at executing tasks. Expect to be constantly challenged, improving and evolving for the better.

The full job description covers all associated skills, previous experience, and any qualifications that applicants are expected to have.

You and other engineers in this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of AI-based applications that affect core data science.

If you're creative, passionate about what you do, autonomous, and love having fun, then what are you waiting for? Apply today!

What You'll Be Doing

  • Building full stack services to analyze the workloads and applications with the goal of improving the SW and infrastructure efficiency.
  • Understand, analyze, profile, and optimize infrastructure supporting deep learning and HPC applications.
  • Build tools to automate workload analysis, workload optimization, and other critical workflows.
  • Extract customer needs and define technical requirements pertaining to hybrid platforms spanning on premise and CSPs.
  • Understand, analyze, and optimize API communications with various products and across various media (HTTP / REST, gRPC, ssh).
  • Collaborate with other product owners to understand and design for their needs, and with service providers to express requirements.
  • Implement production-quality software across NVIDIA's deep learning platform stack.
  • Work with engineering teams across all of NVIDIA to ensure your software integrates seamlessly up and down the stack.

What We Need To See

  • Masters in CS, EE, or CSEE or equivalent experience.
  • 4+ years of job experience.
  • Wide experience with Linux based operating systems.
  • High-level knowledge of a mainstream programming language (Go / Python / JavaScript) as well as the ability to pick up new ones.
  • Understanding of performance, security, and reliability in complex distributed systems.
  • Experience with Git and hosted-GIT services (GitLab, etc.).
  • General web networking knowledge (HTTP, REST, gRPC).
  • A solid understanding of micro-service Architecture and RESTful APIs. Experience with unit test tools and code coverage tools.
  • Excellent data analysis skills and demonstrated ability to solve sophisticated issues involving multiple software or hardware components.

Ways To Stand Out From The Crowd

  • Background with Apache Airflow.
  • Experience with next.js, next.auth, and react-based SPAs, server-sent events. Experience with large micro-service systems.
  • Experience implementing GitLab CI pipelines. Experience with async programming.

The base salary range is $144,000 - $270,250. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

J-18808-Ljbffr

3 days ago
Related jobs
Promoted
Intone Networks
Sunnyvale, California

Experience in creating and maintaining data processing workflows with tools including Airflow or Oozie. Apply now, read the job details by scrolling down Double check you have the necessary skills before sending an application. Here’s how the job details align with your profile. He is looking for so...

Promoted
Waymo
Mountain View, California

The TaaS (Transportation as a Service) and Ride-Hailing Infrastructure team builds mission-critical backend systems, and on-vehicle software, that allows Waymo's autonomous vehicles to operate at tremendous commercial scale. The Waymo Driver powers Waymo One, a fully autonomous ride-hailing service,...

Promoted
Cisco Systems, Inc.
San Jose, California

The networking industry is going through a massive transformation to build the next generation infrastructure to meet the needs of AI/ML workloads and continuously increasing internet users and application. You will be working among engineers who are passionate about tackling complex technology, bui...

Promoted
Samsung Electronics GmbH
San Jose, California

By working together as a dedicated and passionate team, we aim to revolutionize the way AI/ML applications are deployed and executed, ultimately contributing to the advancement of AGI in an affordable and sustainable manner. Collaborate with hardware and software engineers to ensure that the systems...

Promoted
Samsung SDS America
San Jose, California

Samsung SDS America is looking for a Senior Systems Administrator with a strong background in system administration, infrastructure automation, and network management to join our Device Solutions America (DSA) Lab Support team to support Samsung Semiconductor. The ideal candidate will have hands-on ...

Promoted
Samsung Electronics GmbH
San Jose, California

Lead infrastructure projects; participate in the company’s application development projects as a systems specialist and solution provider, with the overall aim of providing well engineered, secure, and integrated solutions. Senior Engineer, IT Infrastructure Architecture. Job Title Senior Engineer, ...

Promoted
Walmart
Sunnyvale, California

As Senior Manager, Software Engineering in Data Ventures, you will be leading a team of 10+ engineers for Core Services / Platform engineering. Option 1: Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 5 years’ expe...

Promoted
Nuro, Inc.
Mountain View, California

Extensive experience (5+ years) developing production-quality software in C/C++ and Python for complex systems in areas such as self-driving, robotics, embedded Linux systems, telecommunications, or other technology industries. Read on to fully understand what this job requires in terms of skills an...

Mediabistro
Campbell, California

Required Skills Proven experience as a full stack engineer with an emphasis on troubleshooting, pragmatic problem solving, and using industry-standard technologies Javascript/Typescript experience Experience in a modern frontend framework such as React or Vue (React is preferred) Backend development...

Couchbase
Santa Clara, California

As a Senior Software Engineer on the AI team, your job will be to design and implement cutting-edge database and AI features and tools using the latest techniques to evolve Couchbase products and Capella service. We have a world-class engineering team that is working not only on bringing AI to Couch...