Search jobs > Santa Clara, CA > Remote > Senior infrastructure

Senior Governance Infrastructure Platform Automation Engineer - NGC

NVIDIA
Santa Clara, CA, US
$164K-$310.5K a year
Remote
Full-time

Our technology has no boundaries! NVIDIA is building the world’s most groundbreaking and pioneering computing platforms. Because of our work, scientists, researchers, and engineers can advance their ideas.

At its core, our visual computing technology not only enables an outstanding computing experience, but it is also energy efficient! We pioneered a supercharged form of computing loved by the most fast-paced computer users in the world - scientists, designers, artists, and gamers.

It’s not just technology, though! It is our people, some of the brightest in the world, and our company makes NVIDIA one of the most fun, innovative, and dynamic places to work! At the center of NVIDIA are our core values, like innovation, excellence, determination, and team, that guide us to be the best we can be.

What you'll be doing :

Develop, maintain, and improve our Developer Platform and tooling to empower our developers, enable sophisticated cross-platform build systems, and deliver extraordinary infrastructure platform for Nvidia and its Customers.

Partner and cross-collaborate with developers, QA, and product teams to establish, refine, and streamline our SW and Infrastructure management, automation, and processes.

Strong System Admin experience using Infrastructure-as-code with tools such as Ansible, Puppet, Chef & Terraform.

Design and implement monitoring solutions to gain insight into applications and system health. Implement critical metrics using various analytics methods and dashboards.

Craft and develop tools needed for automating workflows. Reuse AI techniques to extract useful signals about machines and jobs from the data generated.

Take part in prototyping, crafting, and developing cloud infrastructure for NVIDIA.

Drive infrastructure resource efficiency initiatives with engineering and finance

What we need to see :

Bachelor's or Master’s degree in computer science, Software Engineering, or equivalent experience.

8+ years of overall experience.

Solid programming background in Python and / or similar scripting languages.

Experience in maintaining cloud infrastructure and highly available production environment.

Strong understanding of architectural requirements and development processes in building reliable, robust, scalable data products and pipelines.

Experience in Databases, both SQL (MySQL) and NoSQL (Elastic Search / MongoDB / Cassandra).

Proficient with configuration management tools like Ansible, Puppet, Chef, and source code management & binary repository systems like GitLab, GitHub, Artifactory etc.

Strong background with Gitlab, Jenkins, and / or other CI / CD systems.

Proficient with Kubernetes administration, dockers & virtualization. Knowledge of standard methodologies related to security.

Proficient with data analytics / visualization & monitoring tools like Kibana, Grafana, Splunk, Zabbix, Prometheus, and / or similar systems.

The base salary range is 164,000 USD - 310,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

30+ days ago
Related jobs
Promoted
VirtualVocations
Fremont, California

A company is looking for a Senior Software Engineer - Event Platform Storage. Key Responsibilities:Build and operate distributed, high-throughput, real-time data pipelinesDesign and architecture effective data pipelines using modern technologyWrite code, lead architectural decisions, and own meaning...

Promoted
CARIAD
Mountain View, California

As a Senior Staff Embedded Platform Graphics Engineer, you will be at the forefront of developing and optimizing graphics pipeline for our next-generation automotive platforms. To achieve that we are building the leading tech stack for the automotive industry and creating a unified software platform...

Promoted
VirtualVocations
Fremont, California

A company is looking for a Senior DevOps Engineer for Cloud Infrastructure. ...

Promoted
Sustainable Talent
CA, United States

Collaborate with multi-functional teams, including system engineering, software engineering, mechanical/thermal engineering, operations, data center teams, external vendors, and other partners to successfully deliver a reliable and robust platform from concept to prototype to deployments. Senior Dev...

Promoted
TikTok
San Jose, California

Our platform is built to help imaginations thrive. We are currently seeking a passionate machine learning engineer to join our team. In this role, you will collaborate with the product and engineering teams to identify opportunities and improve overall system performance and efficiency by integratin...

Promoted
Aurora Innovation
Mountain View, California

Software Engineer - Autonomy Data: Labels Platform. Apply production best practices to ensure platform reliability. ...

TikTok
San Jose, California

What You'll Do:- Collaborate closely with developers and business stakeholders to research and implement solutions (tools, platforms, frameworks, to improve test efficiency, such as automation solutions for functional, regression, and performance testing. Our platform is built to help imagination th...

MediaTek
San Jose, California

The architect will work closely with the engineering leaders and product managers to propose and architect the high-level system design for our leading products to win the market. The ideal candidate should be an expert and enthusiastic in Android platform stack from software system down to hardware...

Cisco
San Jose, California

As a Senior Software Engineer (Automation Tools) on the Platform Engineering Diagnostics team, you will be focused on tool development and involved in the website development from front-end to back-end. Supporting the QA infrastructure update or enhancement using Python or Tcl/expect. Maintain exist...

NVIDIA
Santa Clara, California
Remote

We are looking for expert engineers to come and help design rack level solutions for next generation scaling AI supercomputing platforms. We are looking for a strong technical platform software engineer focused on PCIe firmware, you will own PCIe stack for all NVIDIA GPU servers from firmware and so...