Talent.com
Staff Software Engineer, Slurm

Staff Software Engineer, Slurm

Crusoe Energy Systems LLCSan Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About the Role :

We are actively seeking an exceptional Staff Software Engineer to join our cloud software team, focusing specifically on building and operating Slurm as a fully managed cloud service within Crusoe Cloud. This role is crucial for delivering next-generation orchestration capabilities to power GPU-accelerated and high-performance computing (HPC) at scale.

Your expertise will be instrumental in designing and scaling our carbon-reducing operating model, and advancing our AI training clusters to lead the industry in reliability and performance. You will shape the technical direction of systems that allow customers to run advanced workloads across CPUs, NVIDIA and AMD GPUs, and high-performance networking environments.

You will be involved in writing and reviewing code, contributing to proposals, and drafting architecture documents. You will evaluate tools and frameworks, considering their impact on reliability, scalability, operational costs, and ease of adoption.

What You'll Be Working On :

Lead the development and engineering of our managed Slurm offering, providing a seamless experience for AI / ML and HPC customers who rely on robust Slurm job scheduling.

Contribute to the development of scalable and robust software solutions, closely aligning with the strategic objectives outlined in the Crusoe Cloud roadmap.

Design, build, and maintain Kubernetes operators and controllers dedicated to managing the lifecycle, configuration, and state of large-scale Slurm clusters.

Drive the integration of GPU acceleration in the Slurm environment, including device plugin architecture, GPU operators, accelerator-aware scheduling, and resource allocation.

Ensure that high-performance networking technologies, such as InfiniBand and RoCE, are correctly leveraged for distributed GPU workloads running through Slurm.

Implement and manage features such as multi-tenancy, cluster lifecycle management, auto-scaling, and high availability for the managed Slurm control plane services.

Develop scalable systems to compete with leading managed services.

Support the development of your peers by sharing knowledge and providing guidance in technical discussions.

What You'll Bring to the Team :

You have 7+ years of experience working in software engineering, with strong experience in Systems Engineering. Experience in distributed systems, cloud, or HPC environments is a must

You possess 2+ years of programming experience in GoLang . Strong proficiency in other systems languages (Rust, C++, Python for HPC tooling) is also beneficial.

You have extensive experience with Kubernetes and Linux Engineering and debugging .

You possess deep knowledge of Slurm (Simple Linux Utility for Resource Management) administration and the architecture required for managing compute jobs in high-performance environments.

You are skilled in infrastructure as code and familiar with systems-level challenges, ideally with experience utilizing Terraform .

You understand Argo, CI / CD, and Automated Testing pipelines . You can design system architecture, taking ownership of system architecture, including CI / CD pipelines, while ensuring adherence to security standards.

Strong knowledge of container networking (CNI plugins, service meshes) and Linux networking fundamentals.

Familiarity with GPU integration in Kubernetes, including device plugins and GPU operators.

You have excellent communication skills, both verbal and written.

Compensation Range

Compensation will be paid in the range of $185,000 - $224,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex / gender, sexual preference / orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

#J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Staff Software Engineer • San Francisco, CA, United States

Job_description.internal_linking.related_jobs
  • serp_jobs.job_card.promoted
Senior TypeScript Backend Engineer

Senior TypeScript Backend Engineer

VirtualVocationsConcord, California, United States
serp_jobs.job_card.full_time
A company is looking for a Backend Engineer responsible for designing, building, and operating backend systems and indexers for Web3 products. Key Responsibilities Design, build, and operate produ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
Senior Backend Engineer

Senior Backend Engineer

VirtualVocationsConcord, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior / Backend Full Stack Engineer.Key Responsibilities Design, develop, and maintain efficient, reusable, and reliable code across the full stack Lead the developme...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Full Stack Engineer

Full Stack Engineer

VirtualVocationsSanta Clara, California, United States
serp_jobs.job_card.full_time
A company is looking for a Full Stack Engineer.Key Responsibilities Develop and maintain server-side applications using Node. Design and implement RESTful APIs to support front-end services and cr...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Senior Full-Stack Engineer

Senior Full-Stack Engineer

VirtualVocationsConcord, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior Full-Stack Engineer to design and build web service backends and frontends for game-integrated web experiences. Key Responsibilities Design, build, and deploy web...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Application Developer II

Application Developer II

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for an Application Developer II.Key Responsibilities Reviews, analyzes, modifies, creates, debugs, and tests applications Implements code and documents system changes based ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
MERN / MEAN Stack Developer

MERN / MEAN Stack Developer

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a MERN / MEAN Stack Developer.Key Responsibilities : Develop websites using HTML, CSS, Node.Angular2+ Create, deploy, and maintain automated system tests while collaboratin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
Senior Backend Engineer

Senior Backend Engineer

VectraSan Jose, CA, United States
serp_jobs.job_card.full_time
Vectra is the leader in AI-driven threat detection and response for hybrid and multi-cloud enterprises.The Vectra AI Platform delivers integrated signal across public cloud, SaaS, identity, and dat...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Senior MEAN Stack Developer

Senior MEAN Stack Developer

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior MEAN Stack Developer to join their IT Services department focusing on microservices development. Key Responsibilities Design, develop, maintain, test, and documen...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Java Backend Engineer

Java Backend Engineer

VirtualVocationsConcord, California, United States
serp_jobs.job_card.full_time
A company is looking for a Software Engineer focused on Java backend development.Key Responsibilities Design, build, and maintain backend services using Java Collaborate in an Agile team to deli...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Senior Full Stack Engineer

Senior Full Stack Engineer

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior Full Stack Engineer, Commerce.Key Responsibilities Lead design and delivery of commerce features that drive measurable business impact Guide platform scaling in...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Senior Backend Developer

Senior Backend Developer

VirtualVocationsConcord, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior Back-End Developer (Magento 2).Key Responsibilities Develop and maintain custom Magento 2 modules and integrations Customize existing modules and extensions in ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
DevOps Engineer

DevOps Engineer

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a DevOps Engineer to support modern cloud-native application infrastructure.Key Responsibilities Design, implement, and maintain robust CI / CD pipelines for reliable softw...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Backend Software Engineer

Backend Software Engineer

VirtualVocationsConcord, California, United States
serp_jobs.job_card.full_time
A company is looking for a Software Engineer - Backend.Key Responsibilities Build and maintain mission-critical backend services for actionable insights from unique datasets Create new solutions...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Backend Engineer

Backend Engineer

VirtualVocationsSan Jose, California, United States
serp_jobs.job_card.full_time
A company is looking for a Backend Engineer to design and implement backend systems and APIs.Key Responsibilities Design and implement backend systems, APIs, and features Own services end-to-end...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Full-Stack Engineer

Full-Stack Engineer

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a Full-Stack Engineer to architect, develop, and maintain software solutions in a fintech and proptech ecosystem. Key Responsibilities Design and implement scalable back-e...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Full Stack Software Developer

Full Stack Software Developer

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a Full Stack Software Developer - Integrated Water Systems.Key Responsibilities Spearhead the design, development, and deployment of software solutions for data infrastru...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Staff Full-Stack Engineer

Staff Full-Stack Engineer

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a Staff Fullstack Engineer with 10+ years of experience to lead and develop complex systems.Key Responsibilities Design and develop modern full-stack applications using N...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Applications Developer 3

Applications Developer 3

VirtualVocationsConcord, California, United States
serp_jobs.job_card.full_time
A company is looking for an Applications Developer 3 to join their IT transformation team.Key Responsibilities Collaborate with engineers to support IT transformation initiatives Design and deve...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
Senior Backend Software Engineer

Senior Backend Software Engineer

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior Backend Software Engineer - Enterprise / Commerce (100% Remote).Key Responsibilities Design, build and operate API and full-stack solutions Collaborate cross-fu...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Frontend Engineer II

Frontend Engineer II

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Frontend Engineer II (with Webflow expertise).Key Responsibilities Develop and implement high-quality, scalable frontend code using HTML, CSS, JavaScript, and modern fr...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30