Search jobs > Phoenix, AZ > System administrator

HPC System Administrator

General Dynamics Information Technology
Phoenix, Arizona, United States of America
Full-time

At GDIT, people are our differentiator. Our work depends on an On Site HPC Systems Admin joining our team to support the National Oceanic and Atmospheric Administration (NOAA), Weather and Climate Operational Supercomputer System (WCOSS).

This position is on-site at a datacenter in the Phoenix, AZ.

WCOSS provides NOAA the operational High Performance Computing (HPC) resources essential to process sophisticated numerical models used to predict and understand atmospheric and oceanic phenomena for weather and climate operational use.

Operating 24 / 7, the next 10-year WCOSS program will deliver significant computational capability that will evolve over time to keep pace with NOAA’s growing environmental modeling needs.

We are looking for individuals to join GDIT’s team to deploy, operate and support leading-edge technology for WCOSS. Specific technology training will be provided.

CANDIDATES MUST HAVE AN ACTIVE PUBLIC TRUST CLEARANCE OR ABOVE TO BE CONSIDERED.

We think. We act. We deliver. There is no challenge we can’t turn into opportunity.

In this role, a typical day will include :

  • Applying current HPC systems administrative skills; desire to learn and deploy new technologies.
  • Developing and deploying monitoring capabilities.
  • Developing and implementing tools for cluster administration.
  • Providing technical support with team of HPC System & Storage Administrators to resolve operational issues.
  • Providing off-hour on-call support on a rotating basis.
  • Managing, planning, and reporting for on-site vendor / subcontractor activities.
  • Working on site at a Manassas data center
  • Managing on-site office and access for vendors and subcontractors
  • Contributing to planning for software and hardware upgrades along with future installations

REQUIRED QUALIFICATIONS

  • Bachelor’s degree or equivalent and 10+ years of experience with HPC systems operations.
  • Experience working in a 24X7 operational environment.

DESIRED QUALIFICATIONS

  • Demonstrated experience to deploying and managing large-scale HPC systems using OS provisioning tools (e.g., xCat, HPCM, Bright).
  • Demonstrated experience using configuration management tools (e.g., Ansible, Puppet).
  • Linux system administration experience (e.g., SLES, RedHat or CentOS).
  • Batch management / scheduling experience, PBSpro preferred.
  • Parallel filesystem configuration and monitoring experience (e.g., Lustre, NFS).
  • Network interconnect configuration and monitoring experience (e.g., Infiniband, Ethernet).
  • Programming or scripting in at least two languages (e.g., Bash, Perl, Python, C).
  • Strong writing skills for technical documents, system procedures, user wiki’s and FAQs.
  • Ability to work both independently and as part of a team.
  • Knowledge / experience with managing subcontractors or vendors under Service Level Agreements (SLAs)
  • Knowledge of computer system power and cooling (air and liquid cooling)
  • Experience managing, maintaining and repairing HPC and server hardware
  • 30+ days ago
Related jobs
General Dynamics Information Technology
Phoenix, Arizona

Our work depends on an On Site HPC Systems Admin joining our team to support the National Oceanic and Atmospheric Administration (NOAA), Weather and Climate Operational Supercomputer System (WCOSS). Providing technical support with team of HPC System & Storage Administrators to resolve operational i...

Promoted
ManTech
Chandler, Arizona

L2 and L3 network equipment (routers, switches,. Monitor network performance to. Follow organizational IT security procedures for network setup, installing firewalls, VPN, IDS/IPS, etc. Understanding of Network Protocols. ...

Promoted
GeoLogics Corporation
Scottsdale, Arizona

As a Senior Principal Systems Engineer, you'll participate in requirements analysis and management, functional analysis, performance analysis, system design, trade studies, systems integration and test (verification) in the development and evaluation of networks and information systems. Requires a B...

Promoted
General Dynamics Mission Systems
Scottsdale, Arizona

Requires a Bachelor's degree in Software Engineering, or a related Science, Engineering or Mathematics field. Apply the appropriate standards, processes, procedures, and tools throughout the system development life cycle to support the generation of technical engineering products. Our engineers rede...

Promoted
VirtualVocations
Tempe, Arizona

A company is looking for a Senior Linux Systems Engineer. ...

Promoted
Wipro
Chandler, Arizona

RedHat Linux and Shell scripting is important. Requires related experience in the UNIX systems and shell scripting, design, maintenance, and administration of NOSQL databases (Redis, Memsql, CockroachDB, MongoDb, Cassandra, etc. ...

Promoted
KPG99 INC
Phoenix, Arizona

...

Promoted
Blue Origin
Phoenix, Arizona

We're working to develop reusable, safe, and low-cost space vehicles and systems within a culture of safety, collaboration, and inclusion. As part of a hardworking team of diverse analysts, you will conduct thermal and fluids analysis to define the performance and operations for various spaceflight ...

Promoted
Viasat
Tempe, Arizona

Do you thrive in a fast-paced environment where you can make a difference? If so, come join our Resilient Space Missions team as a Systems Engineer. You will work with interdisciplinary teams including Business Development, Systems, Software, RF circuits, hardware engineers, and others to. The produ...

St. Mary's Food Bank
Phoenix, Arizona

The Systems Administrator will oversee the accessibility and functioning of IT systems, diagnose and resolve problems that affect system performance or accessibility to an IT service, and more. Microsoft Certified Systems Administrator (MCSA)) is a plus. Proven experience as a System Administrator, ...