Talent.com
HPC Support Engineer

HPC Support Engineer

VirtualVocationsGarden Grove, California, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

A company is looking for a Super Intelligence HPC Support Engineer.

Key Responsibilities

Act as the primary technical point of escalation for customers running hyperscale GPU clusters

Lead incident response for complex issues, ensuring rapid triage and timely resolution

Proactively identify risks and drive preventative improvements in large environments

Required Qualifications

7+ years of experience in HPC or cloud support engineering with customer-facing responsibilities

Proven experience managing large-scale Linux clusters and distributed HPC / AI workloads

Deep expertise in orchestration tools such as Kubernetes and / or Slurm

Strong knowledge of GPU technologies (CUDA, NCCL, MIG, NVLink, GPUDirect RDMA)

Skilled in high-throughput networking and cluster storage solutions

serp_jobs.job_alerts.create_a_job

Hpc Engineer • Garden Grove, California, United States