The Argonne Leadership Computing Facility (ALCF) division within our Computing and Environmental Life Sciences (CELS) directorate is seeking an HPC Systems Administration Specialist to join their team.
ALCF houses one of the world’s first exascale supercomputers named Aurora, its high computing speed and artificial intelligence capabilities will enable science that is impossible today.
Aurora will be capable of delivering over 2 exaflops of computing power, or 2 billion-billion calculations per second.
As an HPC System Administration Specialist, you can expect to :
- Have researchers rely on your hardware and software expertise thereby contributing to research outcomes.
- Design, implement, and manage world-class ALCF supercomputers, such as Aurora, with attendant supporting software and infrastructure, for use by open-science researchers.
- Integrate, support, and document a diverse array of hardware and software at world-class scale.
- Expertly support systems in a complex environment and work efficiently with other operations groups.
Position Requirements
Required qualifications and skills :
- Interest in HPC simulation, emerging trends, and workflows, and be able to work directly with other systems administrators to ensure the continued operations, reliability, and sustainability of ALCF systems
- Advanced knowledge of the Linux operating system
- Knowledge of job-resource managers
- Programming languages (e.g., Python, Go)
- IB and Ethernet based networks
- Compute interconnect principles
- Configuration management tools such as Ansible or similar
- Experience with version control platforms
- To perform the essential functions of this position successful applicants must provide proof of U.S. citizenship and reside in the U.
S., which is required to comply with federal regulations and contract.
Preferred qualifications and skills :
- HPC operations and user workflows
- Software packaging, building software from source, and dynamic linking (e.g., RPM or Spack)
- Software build tools (e.g., CMake, Make, or Autotools)
- Compute virtualization stacks (e.g., Docker or Singularity)
- Lustre parallel file systems
- Data movement (e.g., Globus)
- HPE / Cray HPC system administration
This position can be hired at one of two levels, and the requirements for each are as follows :
- PT2 : Bachelor's degree and 2+ years of experience, or equivalent
- PT3 : Bachelor's degree and 4+ years of experience, or equivalent
Job Family
Professional Technical (PT)
Job Profile
Systems Integration Admin / Support 2
Worker Type
Regular
Time Type
Full time