Talent.com
Senior HPC Cluster Engineer
Senior HPC Cluster EngineerVirtualVocations • Fremont, California, United States
Senior HPC Cluster Engineer

Senior HPC Cluster Engineer

VirtualVocations • Fremont, California, United States
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

A company is looking for a Senior AI and ML HPC Cluster Engineer.

Key Responsibilities

Provide leadership and strategic guidance on managing large-scale HPC systems, including deployment of compute, networking, and storage

Develop and enhance the ecosystem around GPU-accelerated computing, including scalable automation solutions

Build and maintain AI and ML heterogeneous clusters both on-premises and in the cloud

Required Qualifications

Bachelor's degree in Computer Science, Electrical Engineering, or related field, or equivalent experience

Minimum 5+ years of experience designing and operating large-scale compute infrastructure

Experience with AI / HPC advanced job schedulers, such as Slurm, K8s, PBS, RTDA, or LSF

Proficient in administering Centos / RHEL and / or Ubuntu Linux distributions

Solid understanding of cluster configuration management tools such as Ansible, Puppet, or Salt

serp_jobs.job_alerts.create_a_job

Senior Hpc Engineer • Fremont, California, United States

Job_description.internal_linking.related_jobs
Senior Building Engineer (Nvidia)

Senior Building Engineer (Nvidia)

CBRE Group • Santa Clara, CA, US
serp_jobs.job_card.full_time
Senior Building Engineer (Nvidia).CBRE is looking for Senior Building Engineers who love keeping facilities cool, comfortable, and running smoothly. Sound like you? If you are ready to make a differ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer, GPU Infrastructure - HPC

Software Engineer, GPU Infrastructure - HPC

OpenAI • San Francisco, CA, United States
serp_jobs.job_card.full_time
Software Engineer, GPU Infrastructure - HPC.The Fleet team at OpenAI supports the computing environment that powers our cutting‑edge research and product development. We oversee large‑scale systems ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Mechanical Engineer - HVAC Controls Focused

Senior Mechanical Engineer - HVAC Controls Focused

Harley Ellis Devereaux (HED) • San Francisco, CA, United States
serp_jobs.job_card.full_time
Responsible for the design building automation and controls for medium to large, complex projects from schematics through construction administration. TYPICAL DUTIES - PROJECT RELATED.Responsible fo...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Sr. Hardware Engineer, HVAC & Thermal

Sr. Hardware Engineer, HVAC & Thermal

Lucid Motors • Newark, CA, US
serp_jobs.job_card.full_time
Leading the future in luxury electric and mobility.At Lucid, we set out to introduce the most captivating, luxury electric vehicles that elevate the human experience and transcend the perceived lim...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Firmware Engineer

Senior Firmware Engineer

Gridware • San Francisco, CA, United States
serp_jobs.job_card.full_time
Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior AV Design Engineer - Workplace

Senior AV Design Engineer - Workplace

Diversified • Santa Clara, CA, United States
serp_jobs.job_card.full_time
Diversified is a global leader in audio visual and media innovation, recognized for designing and building the world’s most experiential environments. Our award-winning team specializes in deliverin...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
AV Field Engineer

AV Field Engineer

Stanford University • Stanford, CA, US
serp_jobs.job_card.full_time
Stanford University is expanding its Audio / Video (A / V) services, and we are looking for an AV Field Engineer who can excel in orchestrating and prioritizing timely deployment of multiple large A / V ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Sr. Adhesives Engineer, HV Battery Enclosures

Sr. Adhesives Engineer, HV Battery Enclosures

Lucid Motors • Newark, CA, US
serp_jobs.job_card.full_time
Adhesives Engineer, HV Battery Enclosures.Leading the future in luxury electric and mobility.At Lucid, we set out to introduce the most captivating, luxury electric vehicles that elevate the human ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Simulation Environments Engineer

Simulation Environments Engineer

OpenAI • San Francisco, CA, United States
serp_jobs.job_card.full_time
Simulation Environments Engineer | OpenAI - Careers.Simulation Environments Engineer.Our Robotics team is focused on unlocking general-purpose robotics and pushing towards AGI-level intelligence in...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Emulation Engineer

Emulation Engineer

OpenAI • San Francisco, CA, United States
serp_jobs.job_card.full_time
OpenAI’s Hardware organization develops silicon and system-level solutions designed for the unique demands of advanced AI workloads. The team is responsible for building the next generation of AI-na...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Firmware Engineer

Senior Firmware Engineer

Gridware Technologies Inc. • San Francisco, CA, United States
serp_jobs.job_card.full_time
Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Engineer IV - AHP

Engineer IV - AHP

Insight Global • Oakland, CA, US
serp_jobs.job_card.full_time
IG is looking for an Engineer to join the Asset Health Performance team in support of EFD sensors.Currently have line sensor on the distribution systems side, looking at line sensor alerts that are...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Co-Op : Tools & Fixtures System Engineer

Co-Op : Tools & Fixtures System Engineer

El Camino Health • San Francisco, CA, United States
serp_jobs.job_card.full_time
At iRhythm, you’ll have the opportunity to grow your skills and your career while impacting the lives of people around the world. Rhythm is shaping a future where everyone, everywhere can access the...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
HPC Engineer

HPC Engineer

AMAX • Fremont, CA, US
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
We are seeking a highly skilled and motivated HPC Engineer to join our Engineering team.This individual will design, implement, optimize, and support high-performance computing solutions tailored t...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Sr. Interdisciplinary Systems Engineer - EV Charging, GFP Electrification & Infrastructure

Sr. Interdisciplinary Systems Engineer - EV Charging, GFP Electrification & Infrastructure

Amazon • San Francisco, CA, United States
serp_jobs.job_card.full_time
Interdisciplinary Systems Engineer - EV Charging, GFP Electrification & Infrastructure.Join our mission to revolutionize electric vehicle charging infrastructure and accelerate Amazon's commitment ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Firmware Engineer – GPU Networking

Senior Firmware Engineer – GPU Networking

NVIDIA Corporation • Santa Clara, CA, United States
serp_jobs.job_card.full_time
Senior Firmware Engineer – GPU Networking page is loaded## Senior Firmware Engineer – GPU Networkinglocations : US, CA, Santa Claratime type : Full timeposted on : Posted Todayjob requisition id...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Lead HPC Infrastructure Engineer

Lead HPC Infrastructure Engineer

Referrals Only • San Francisco, CA, United States
serp_jobs.job_card.full_time
We are seeking a highly accomplished engineer to take ownership of the operations and optimization of next-generation NVIDIA GB200 and GB300 GPU clusters. This role sits at the intersection of high-...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior GNC Engineer - Controls and Simulation

Senior GNC Engineer - Controls and Simulation

E-Space • Saratoga, CA, US
serp_jobs.job_card.full_time
Ready to make connectivity from space universally accessible, secure and actionable? Then you've come to the right place!. E-Space is bridging Earth and space to enable hyper-scaled deployments of I...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Silicon Implementation Engineer - Custom Circuits

Silicon Implementation Engineer - Custom Circuits

OpenAI • San Francisco, CA, United States
serp_jobs.job_card.full_time
OpenAI’s Hardware organization develops silicon and system-level solutions designed for the unique demands of advanced AI workloads. The team builds the next generation of AI-native silicon while co...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Production Engineer, Compute

Senior Production Engineer, Compute

Crusoe Energy Systems LLC • San Francisco, CA, United States
serp_jobs.job_card.full_time
At Crusoe, we are building the most sustainable, AI-first cloud infrastructure, and our Compute-focused Site Reliability Engineers are the backbone of that mission. This role is centered on supporti...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted