Search jobs > Rockville, MD > Hpc engineer

HPC Systems Engineer

Zachary Piper
Rockville, MD
$120K-$150K a year
Full-time

Zachary Piper Solutions is in need of a HPC Systems Engineer to support the National Institutes of Allergy and Infectious Diseases (NIAID).

The HPC Systems Engineer will support HPC hardware, install scientific applications, and so much more to monitor the health of NIAID's HPC clusters.

This is remote position based out of Rockville, MD. The HPC Systems Engineer will collaborate with clients to define and document IT project scope.

priority, budgets and schedules and oversee implementation of projects.

Responsibilities Include :

  • Work with a 4000+ core HPC cluster that is GPU-focused and a 1,500+ HPC cluster supporting the hardware and operating system environments
  • Supporting bioinformatics applications for a large and diverse research community with needs in genomics, cryo-electron microscopy, and AI / ML
  • Monitor the portfolio of software applications and be proactive in planning upgrades and license renewals
  • Monitor and report on cluster performance and generate data to show usage and trends
  • Triage support requests from the research community and work with others in the Scientific Infrastructure team to resolve issues and complete service requests
  • Collaborate with researchers to guide them in effective use of the HPC resources, such as job scheduler submission, data formats, and building data workflows
  • Engage with researchers to understand their HPC needs to include data life cycle management, integration of scientific instruments to HPC, and storage capacity and compute requirements

Requirements Include :

  • Bachelors Degree in Information Technology or alike field
  • Minimum of 5 years of experience with servers, datacenters, networking, and related technologies
  • Minimum of 5 years of experience managing Linux systems
  • Experience with Spack package manager, including making packages from PyPi, R, Github
  • Experience installing and packaging GPU applications and optimizing job submission scripts that are used for ML model training, data mining operations, or high-res graphics rendering
  • Experience with Python scripting, Git, Ansible, and Terraform
  • Ability to obtain a NIH Public Trust

Compensation Includes :

  • $120,000 - 150,000 *depending on experience*
  • Health, Dental, Vision, 401K, PTO, Paid Holidays, etc.

LI-CB1

LI-REMOTE

Keywords : Systems Engineer, HPC, High performance compute, cluster, HPC cluster, GPU, GPU focused, core, hardware, operating system, HPC Engineer, python, scripting, python scripting, git, git workflows, git distributed workflows, ansible manage system configuration, ansible, terraform, system, cluster performance, scheduler, schedule, job schedule

9 days ago
Related jobs
Promoted
Zachary Piper
Rockville, Maryland

Keywords: Systems Engineer, HPC, High performance compute, cluster, HPC cluster, GPU, GPU focused, core, hardware, operating system, HPC Engineer, python, scripting, python scripting, git, git workflows, git distributed workflows, ansible manage system configuration, ansible, terraform, system, clus...

Zachary Piper Solutions
Rockville, Maryland

The HPC Systems Engineer will support HPC hardware, install scientific applications, and so much more to monitor the health of NIAID's HPC clusters. The HPC Systems Engineer will collaborate with clients to define and document IT project scope. Work with a 4000+ core HPC cluster that is GPU-focused ...

Promoted
KBR
Fulton, Maryland

Senior Data Center Engineer Infrastructure/icloud – TS/SCI. Infrastructure/Cloud Engineering: VMware vCloud Enterprise Suite; hybrid cloud with multiple classified Government clouds. KBR’s National Security Solutions team provides high-end engineering and advanced technology solutions to our custome...

Promoted
Absolute Business Solutions Corp (ABSC)
Silver Spring, Maryland

ABSC is seeking a System Administrator/Machine Learning Operations (MLOps) supporting DIA-NMEC under the DOMEX Data Discovery Platform (D3P) Modernization program which falls under our 10 year DOMEX Technology Platform (DTP) contract. Bring your mix of intellectual curiosity, quantitative acumen, an...

Promoted
DISH Network
Potomac, Maryland

As the team grows, lead and mentor junior electrical engineers, providing guidance and expertise in electrical engineering principles and aviation-specific knowledge. Bachelor's degree Electrical Engineering, Aviation, Computer Engineering or a related technical field. Master's degree Electrical Eng...

Promoted
Realm One
Central Maryland, Maryland

Bachelor’s degree in System Engineering, Computer Science, Information Systems, Engineering Science, Engineering Management, or related discipline. Enterprise IT contract with a team of 60+ engineers responsible for the architecture, engineering, integration, operations, maintenance, and sustainment...

Promoted
Skyward IT Solutions
Rockville, Maryland

We are committed to creating an inclusive and equitable environment where everyone, regardless of gender, race, ethnicity, sexual orientation, disability, or background, can thrive. Are you someone who is the "go-to IT person" in your group of friends or at your workplace? You know, the person who a...

Promoted
Office of The Chief Financial Officer
Maryland, MD, United States

Experience building an automated cloud infrastructure across Prod and Non-Prod environments. Experience with Configuration management and Infrastructure as Code (IaC) toolsets. ...

Promoted
DMI (Digital Management, Inc.)
Bethesda, Maryland

We are seeking a Lead DevOps Engineer with experience in design, development, and implementation of projects in the cloud. Cloud Architect Associate, DevOps Engineer, Developer). Lead a small team of engineers and distribute tasks to support the project. Bachelor's Degree in business, information te...

Promoted
ALTA IT Services
Bethesda, Maryland

Bethesda, MD – hybrid (2 days/week onsite). US citizenship is required per government contract. ...