Search jobs > Santa Clara, CA > Remote > Senior data architect

Senior Software Architect - Data Center Systems

NVIDIA
Santa Clara, CA, US
Remote
Full-time

We are building innovative server systems for GPU accelerated applications, such as Deep Learning. Data Center SW team architects and develops the end to end software and firmware stack for these systems.

We are looking for a Senior Software Architect who has deep expertise in designing server platforms and has added understanding of application use cases in Deep Learning workloads.

You will work with world class engineering teams, product management, Operations and Customer support to build systems that will truly delight our customers.

What you’ll be doing :

You will lead software activities for NVIDIA's deep learning server platforms, from design through production; collaborating with teams across company to deliver software solutions

Drive the system architecture for a complex server platform in a multi-functional environment.

Partner across application software, libraries, system software and firmware teams to design complete software solutions for new server platforms

Work directly with major customers to understand their requirements and work to align their roadmap with NVIDIA’s roadmap.

Work with business partners and vendors to shape their products to meet NVIDIA’s needs.

Develop a roadmap of new technologies and protocols and drive their design and adoption.

Mentor architects and engineering teams to grow them into future leaders.

Make key technical decisions for designs involving complex inter-component dependencies.

What we need to see :

Deep experience in designing architecture for scalable and performant server systems, particularly at the SW / HW interface.

Understanding of HPC or Deep learning workloads and use of accelerated computing platforms.

Expertise in Out of Band and In-band management architectures.

Knowledge of server system architecture and implications of architecture decisions on overall performance of end applications.

Demonstrable experience in implementing left shift strategy to de-risk program execution.

Excellent written and verbal communication skills.

BS or MS degree in Computer Engineering, Computer Science, or related degree or equivalent experience.

10+ years in the area of System architecture and design.

Ways to stand out from the crowd :

Knowledge of cloud and cluster level deployment and management systems.

Strong background of device management protocols such as Redfish, IPMI, MCTP, PLDM and RDE.

Knowledge in storage and networking technologies.

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization.

The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services.

Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us.

Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come, join our Data center server systems team and help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

The base salary range is 220,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

30+ days ago
Related jobs
Promoted
Advanced Micro Devices, Inc
San Jose, California

Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. You will need to drive technical direction for next generation frameworks for AI model training and inference for...

Promoted
TikTok
San Jose, California

The Data Management Suite team is building products that cover the whole lifecycle of data pipeline, including data ingestion and Integration, data development, data catalog, data security and data governance. As a software engineer in the data management suite team, you will have the opportunity to...

Equinix
San Jose, California

Senior Mechanical Engineer, Data Center Cooling Systems. Proven years of professional experience preferred in mechanical engineering program ownership includingaspects of design, operating practices, repair and maintenance, and training focusing on data center or mission-critical systems. Equinix is...

TikTok
San Jose, California

Responsibilities:- Create and execute functionality tests for data products, including UI, server, and big data databases. Proficient in both manual and automated testing, with experience in server-side, database, and data product testing. Track and manage bugs throughout the entire software develop...

Walmart
Sunnyvale, California

The Sponsored Search Data team is responsible for designing, implementing, and maintaining data pipelines, databases and ETL processes for Walmart's Sponsored Search Advertising platform. Strong understanding of backend software engineering, data engineering, and basic data science. Option 1: Bachel...

ByteDance
San Jose, California

About the TeamThe infrastructure team supports the company's fast growth by building and operating hyperscale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services and making sure they are scalable and are reliable. ...

Amazon Development Center U.S., Inc.
Palo Alto, California

Knowledge of professional software engineering & best practices for full software development life cycle, including coding standards, software architectures, code reviews, source control management, continuous deployments, testing, and operational excellence. Experience leading the architecture and ...

Databricks
Mountain View, California

As a software engineer on the Runtime team at Databricks, you will be building the next generation distributed data storage and processing systems that can outperform specialized SQL query engines in relational query performance, yet provide the expressiveness and programming abstractions to support...

Amazon Development Center U.S., Inc.
Sunnyvale, California

If you have good experience in C/C++, and a passion for systems software engineering such as kernel or embedded development, then this is a unique opportunity to join us in building the platform which is the basis for all new EC2 VPC features in the years to come. Communicating with users, other tec...

DeepSight Technology
Santa Clara, California

Senior Imaging Systems Software Engineer. As our Senior Imaging Systems Software Engineer, you'll enjoy a competitive salary ranging from. As our Senior Imaging Systems Software Engineer, you'll be instrumental in advancing the quality and interpretation of ultrasound images. You'll collaborate clos...