Talent.com
HPC Engineer

HPC Engineer

The Aerospace CorporationChantilly, VA, United States
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded research and development center (FFRDC), we are broadly engaged across all aspects of space- delivering innovative solutions that span satellite, launch, ground, and cyber systems for defense, civil and commercial customers. When you join our team, you'll be part of a special collection of problem solvers, thought leaders, and innovators. Join us and take your place in space.

Job Summary

The Aerospace Corporation is seeking a talented and motivated High-Performance Computing (HPC) Engineer (Site Reliability Engineer Staff III / IV) to join our Computational Services team. In this role, you will be responsible for developing, implementing, and optimizing HPC clusters that support both on-premises and cloud environments. You will work alongside rocket scientists and engineers, tackling complex space enterprise challenges and contributing directly to the success of critical national space assets. We value a collaborative, proactive mindset and a shared commitment to engineering excellence.

Work Model

This is a full-time position located in either Chantilly, VA, or El Segundo, CA, with an expectation of 100% onsite work.

What You'll Be Doing

  • Design and implement HPC solutions that optimize resource utilization across diverse workloads in both classified and unclassified settings.
  • Manage a 10,000-core classified cluster and a 3,000-core unclassified cluster to ensure peak performance.
  • Deliver high-quality HPC infrastructure design, automated provisioning, and system configuration.
  • Develop and deploy automation solutions using tools such as Ansible or Puppet.
  • Optimize AI workloads and GPU computing performance.
  • Monitor, analyze, and tune HPC system performance, utilization, and resource allocation to maintain operational efficiency.
  • Work closely with scientists and engineers to support new and ongoing projects, and mission technical analysis supporting national space assets.
  • Develop cost-efficient on-premise and cloud HPC service offerings that align with mission and business objectives.
  • Implement and enforce security best practices that comply with government regulations across both classified and unclassified environments.
  • Harden Linux systems to meet stringent security requirements

What You Need to be Successful

Minimum Requirements for the Site Reliability Engineer Staff III :

  • Bachelor's degree in Computer Science, Engineering, or equivalent experience.
  • Minimum of 5 years' experience in Linux system administration within an enterprise HPC environment.
  • In-depth knowledge of Linux, networking, and HPC systems.
  • Proven experience in managing the Slurm scheduler and setting up HPC systems for both interactive and batch workloads.
  • Proficiency in scripting and competence with automation tools such as Ansible or Puppet.
  • Experience hardening Linux systems to meet security requirements
  • Experience with hardware and infrastructure automation in environments using server vendors such as HPE or Cisco.
  • Strong communication skills, with an ability to work both independently and as part of a geographically distributed team.
  • CompTIA Security+ CE certification or equivalent that meets DoD 8570.01-m requirements for IAT Level II personnel
  • Active TS / SCI clearance. U.S citizenship is required to obtain security clearance.
  • In addition to the above, the minimum requirements for the Site Reliability Engineer Staff IV include :

  • 7+ years of experience in an enterprise HPC environment.
  • Experience performing in-place upgrades of Slurm.
  • Experience provisioning and supporting AI & NVIDIA GPU technologies
  • Skill in provisioning and supporting AI & NVIDIA GPU technologies, with expertise in GPU integration, resource allocation, and scheduling using Slurm.
  • Hands-on background with cloud HPC services
  • Experience supporting a wide range of technical software (compilers, mod&sim tools, languages, COTS, GOTs) including the development of environment modules.
  • Demonstrated ability to lead cross-functional teams and mentor junior engineers.
  • How You Can Stand Out

    It would be impressive if you have one or more of these :

  • Experience with AWS Parallel Computing Service (AWS ParallelCluster).
  • Knowledge of NVLINK or NVSWITCH for optimizing GPU workflows.
  • Familiarity with Prometheus and Grafana for monitoring and performance visualization.
  • Expertise in optimizing and customizing Slurm partitions (queues) to balance utilization and reduce job wait times.
  • Experience provisioning or supporting Slurm REST API
  • Background in containerization within an HPC context used for data processing and technical analysis
  • Proficiency with automation tools such as Ansible or Puppet for HPC
  • Experience developing solutions that optimize data storage
  • Experience managing parallel file systems such as Lustre.
  • An active IC TS / SCI clearance with CI Polygraph.
  • We offer a competitive compensation package where you'll be rewarded based on your performance and recognized for the value you bring to our business. The grade-based pay range for this job is listed below. Individual salaries within that range are determined through a wide variety of factors including but not limited to education, experience, knowledge and skills.

    (Min - Max)

    $135,200 - $220,000

    Pay Basis : Annual

    Leadership Competencies

    Our leadership philosophy is simple : every employee, regardless of level and role, can demonstrate leadership. At Aerospace, our commitment is our people. To cultivate our talent and ensure that we have a strong pipeline of future leaders, we want individuals who :

  • Operate Strategically
  • Lead Change
  • Engage with Impact
  • Foster Innovation
  • Deliver Results
  • Ways We Reward Our Employees

    During your interview process, our team will provide details of our industry-leading benefits.

    Benefits vary and are applicable based on Job Type. A few highlights include :

    Comprehensive health care and wellness plans

    Paid holidays, sick time, and vacation

    Standard and alternate work schedules, including telework options

    401(k) Plan - Employees receive a total company-paid benefit of 8%, 10%, or 12% of eligible compensation based on years of service and matching contributions; employees are immediately eligible and vested in the plan upon hire

    Flexible spending accounts

    Variable pay program for exceptional contributions

    Relocation assistance

    Professional growth and development programs to help advance your career

    Education assistance programs

    An inclusive work environment built on teamwork, flexibility, and respect

    We are all unique, from various backgrounds and all walks of life, yet one thing bonds all of us to each other-the belief that we can make a difference. This core belief empowers us to do our best work at The Aerospace Corporation.

    Equal Opportunity Commitment

    The Aerospace Corporation is an equalopportunity employer. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, age, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender, gender identity or expression, color,religion,geneticinformation, marital status, ancestry, national origin, protected veteran status, physical disability, medical condition, mental disability, or disability status and any other characteristic protected by state or federal law. If you're an individual with a disability or a disabled veteran who needs assistance using our online job search and application tools or need reasonable accommodation to complete the job application process, please contact us by phone at 310.336.5432 or by emailat peoplemangmnt.mailbox@aero.org .You can also review Know Your Rights : Workplace Discrimination is Illegal .

    serp_jobs.job_alerts.create_a_job

    Hpc Engineer • Chantilly, VA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    HPC Engineer

    HPC Engineer

    The Swift GroupLaurel, MD, US
    serp_jobs.job_card.full_time
    As a Senior Systems Engineer at OPS consulting, you will provide portfolio level advisory support to the customer, facilitating the development, acquisition and support of complex systems.You will ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Principal System Engineer - Mission Planning / Flight Dynamics Lead

    Principal System Engineer - Mission Planning / Flight Dynamics Lead

    Iridium Satellite LLCReston, VA, United States
    serp_jobs.job_card.full_time
    Principal System Engineer - Mission Planning / Flight Dynamics Lead.Iridium is an award-winning and innovative satellite communications company with bragging rights to the only network that offers ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    HVAC Helper

    HVAC Helper

    LPR A / C & Heating, Inc.Nokesville, VA, US
    serp_jobs.job_card.full_time
    LPR AC and heating have immediate open positions for an HVAC helper pay rate starts from $18-$20.Small Family based company, est. Small Family based company, est.serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Sr. System Administrator

    Sr. System Administrator

    Foxhole TechnologyLeesburg, VA, United States
    serp_jobs.job_card.full_time
    Discover an exciting career at Foxhole Technology, an innovative IT Engineering firm founded in 2007.As leaders in cybersecurity, DEVSEC OPS, Agile Development, Cloud and IT support for federal civ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Onsite Service Engineer •PC 1487

    Onsite Service Engineer •PC 1487

    Miltenyi Biotec IncGaithersburg, MD, United States
    serp_jobs.job_card.full_time
    Miltenyi Biotec is seeking a highly skilled Onsite Service Engineer to serve as the dedicated on-site at customer facility. In this customer-facing role, you will act as Miltenyi's primary technica...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Structural Design Engineer - Jacksonville - Remote

    Structural Design Engineer - Jacksonville - Remote

    Canam Steel CorporationPoint Of Rocks, MD, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Design and check building components steel joists and metal deck in accordance with contract documents to meet industry codes, manufacturing efficiencies, and shop schedules while being consi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Lead Security Engineer

    Lead Security Engineer

    Foxhole TechnologyLeesburg, VA, United States
    serp_jobs.job_card.full_time
    Job Title : Lead Security Engineer.Location : Leesburg, VA -Hybrid (Onsite 3 days per week).Foxhole Technology provides robust cybersecurity and IT support capabilities for federal civilian and defe...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Control Systems Engineer

    Control Systems Engineer

    Technical SourceManassas, VA, US
    serp_jobs.job_card.full_time
    Technical Source is currently in search of.The qualified candidate will be responsible for the design, commissioning, start up, and troubleshooting of Rockwell / Allen Bradley Building Automation Sys...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Maintenance Operations Crew Chief - Waste Management

    Maintenance Operations Crew Chief - Waste Management

    Loudoun County GovernmentLeesburg, VA, United States
    serp_jobs.job_card.full_time
    Loudoun County Government has been named one of Forbes' 2025 Best Large Employers!.We're proud to be recognized nationally for our commitment to employee satisfaction and excellence in public servi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    HPC Data Storage Engineer

    HPC Data Storage Engineer

    The Aerospace CorporationChantilly, VA, United States
    serp_jobs.job_card.full_time
    The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Controls & Integration Engineer IV

    Controls & Integration Engineer IV

    CPGAshburn, VA, United States
    serp_jobs.job_card.full_time
    Controls & Integration Engineer IV .Ashburn, VA .CONTROLS & INTEGRATION ENGINEER IV.Contr...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Director of Procurement - Point of Rocks, MD

    Director of Procurement - Point of Rocks, MD

    Canam Steel CorporationPoint Of Rocks, MD, United States
    serp_jobs.job_card.full_time
    Point of Rocks, Maryland, United States.At Canam Steel Corporation, the Director of Procurement plays a key strategic and operational role in managing the sourcing, procurement, and supply chain fu...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Full Stack Software Engineer SME

    Full Stack Software Engineer SME

    LeidosAldie, VA, US
    serp_jobs.job_card.full_time
    National Security Sector combines technology-enabled services and mission software capabilities in the areas of cyber, logistics, security operations, and decision analytics to support our defense ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Onsite Service Engineer •PC 1518

    Onsite Service Engineer •PC 1518

    Miltenyi Biotec IncFrederick, MD, United States
    serp_jobs.job_card.full_time
    Miltenyi Biotec is seeking a highly skilled Onsite Service Engineer to serve as the dedicated on-site at customer facility. In this customer-facing role, you will act as Miltenyi's primary technica...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Commercial Hvac Technician

    Commercial Hvac Technician

    Control Tec Mechanical LLCFrederick, MD, US
    serp_jobs.job_card.full_time
    We are looking for an experienced and dependable.In this role, you will be responsible for the.The ideal candidate will have solid technical skills, strong problem-solving abilities, and a commitme...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Mechanical Engineer (Engineer IV)

    Mechanical Engineer (Engineer IV)

    Fairfax County GovernmentFairfax, VA, United States
    serp_jobs.job_card.full_time
    This position includes a sign-on bonus of $5,000 for new county hires.This position provides senior level leadership and is responsible for the overall development and management of large scale and...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Hiring our Heroes Skillbridge - Systems Engineer

    Hiring our Heroes Skillbridge - Systems Engineer

    SYSTEMS PLANNING AND ANALYSIS, INC.Alexandria, VA, US
    serp_jobs.job_card.full_time
    Systems Planning and Analysis, Inc.SPA) delivers high-impact, technical solutions to complex national security issues.With over 50 years of business expertise and consistent growth, we are known fo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Biomedical Engineer 2

    Biomedical Engineer 2

    Inova Health SystemLeesburg, VA, United States
    serp_jobs.job_card.full_time
    Inova Loudoun Hospital Clinical Engineering is looking for a dedicated Biomedical Engineer 2 to join the Team.This will be full-time working Monday - Friday day shift located in Leesburg, VA.The Bi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days