Search jobs > Santa Clara, CA > Remote > Senior network engineer

Senior Network Software Engineer

NVIDIA
Santa Clara, CA, US
Remote
Full-time

We are seeking a

senior network software engineer to join our team. As a Senior Network Software Engineer, you will be instrumental in co-designing and implementing innovative solutions that power AI applications at an unprecedented scale.

Your expertise in network software architecture and high-performance interconnects will drive innovation and enable us to deliver platforms that redefine what is possible.

This is an exceptional opportunity to push the boundaries of technology and shape the future of AI and work with a world-class team of like-minded engineers.

W hat you will be doing :

Collaborate with multi-functional teams to analyze, co-design, and develop networking software and hardware for innovative AI platforms.

Drive the development of new networking algorithms and protocols for point-to-point and collective operations at scale.

Identify bottlenecks and inefficiencies in application code, proposing optimizations to enhance performance and network utilization.

Design and implement performance benchmarks and testing methodologies to evaluate performance at scale.

Provide guidance and recommendations for optimizing AI applications for speed, scalability, and resource efficiency.

Share knowledge with domain expert teams as they develop applications for the next generation of AI platforms.

Contribute to the development of tools and frameworks to facilitate network optimization.

What We Need to See :

PhD in Computer Science, Computer Engineering, or related field, or equivalent experience

10+ years of experience with a focus on high-performance networking and AI applications

Expertise in RDMA networking (InfiniBand, ROCE), Ethernet, and PCIe.

Experience with at least one high-performance networking library : NCCL, UCX, libfabric, MPI, UCC.

Deep understanding of various aspects of high-performance networking, including network technologies, debugging, and performance analysis.

Experience in developing and optimizing deep learning frameworks such as PyTorch and TensorFlow.

Proficiency in Python and C / C++.

Experience in CUDA programming.

Track record of delivering performance improvements for software used in large-scale deployments.

Knowledge of Kubernetes (k8s) and cloud-native application principles is a plus.

Familiarity with continuous integration and delivery practices for performance optimization.

Ways To stand out from the crowd :

Hands-on experience in optimizing networking building blocks for DL frameworks like PyTorch and TensorFlow.

Experience in developing communication libraries such as NCCL, UCX, UCC, MPI.

In-depth knowledge of RDMA, GPU-Direct, and network technologies.

Provide references to your code contributions.

This is an exceptional opportunity to push the limits of state-of-the-art technology and contribute to the development of platforms the world has never seen before.

As part of NVIDIA, you'll work alongside top-tier talent in a collaborative environment that fosters innovation and creativity.

If you're passionate about shaping the future of AI and high-performance computing, apply now to embark on an exciting journey with us!

The base salary range is 220,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

30+ days ago
Related jobs
Promoted
Palo Alto Networks
Santa Clara, California

Bachelor’s Degree in Computer Science/Engineering/Networking or equivalent military experience required. At Palo Alto Networks® everything starts and ends with our mission:. At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. We are seeking a Lead QA Eng...

Promoted
Cisco Systems, Inc.
San Jose, California

As a Senior Software Engineer (Automation Tools) on the Platform Engineering Diagnostics team, you will be focused on tool development and involved in the website development from front-end to back-end. Work closely with HW engineer on the HW tools support like SerDes file generation tool. ...

Promoted
VirtualVocations
Fremont, California

A company is looking for a Senior Software Integration Engineer (Remote). ...

Promoted
pony.ai
Fremont, California

We are looking for an experienced and motivated embedded software engineer to work on automotive compute platforms. You will work cross-functionally with hardware, firmware, and software teams to build the most cutting-edge automotive system. Define system functionality with EE engineers. Develop dr...

Promoted
VirtualVocations
Fremont, California

A company is looking for a Senior Software Engineer in Fleet Management. ...

Promoted
Apple
Cupertino, California

As a Senior SRE you will help lead and mentor other engineers as well as communicate with senior leadership. Apple's Software Delivery team is looking for an innovative Senior SRE with experience managing physical infrastructure and cloud solutions to design, build, and maintain our core infrastruct...

Promoted
VirtualVocations
Santa Clara, California

A company is looking for a Senior Software Engineer, ML Platform. ...

Promoted
TikTok
San Jose, California

TikTok is the leading destination for short-form mobile video.Our mission is to inspire creativity and bring joy.TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.Creation is the core of TikTok's purpose.Our platform is built...

BILL
San Jose, California

At least 12 years of experience in software engineering. BILL is a leader in financial automation software for small and midsize businesses (SMBs). With BILL, businesses are connected to a network of millions of members, so they can pay or get paid faster. We are seeking a strong, passionate enginee...

BILL
San Jose, California

BILL is a leader in financial automation software for small and midsize businesses (SMBs). With BILL, businesses are connected to a network of millions of members, so they can pay or get paid faster. Strong understanding of engineering, product, and design interoperation and communication. Ability t...