Talent.com
Senior HPC Linux Systems Engineer

Senior HPC Linux Systems Engineer

Oak Ridge National LaboratoryOak Ridge, TN, US
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Overview

The High-Performance Computing Systems Section within the National Center for Computational Sciences (NCCS) is seeking a Senior HPC Linux Systems Engineer to join the HPC Infrastructure team. The preferred candidate will possess commensurate knowledge, skills and abilities in addition to relevant education, certifications, experience and demonstrated ability to work as a member of a team.

NCCS provides state-of-the-art computational and data science infrastructure coupled with dedicated technical and scientific professionals tackling large-scale problems across a broad range of scientific domains for accelerating scientific discovery and engineering advances. NCCS hosts the Oak Ridge Leadership Computing Facility (OLCF), one of the Department of Energy\'s (DOE) National User Facilities which operates Frontier, the nation\'s first exascale supercomputer.

Major Duties / Responsibilities

Systems Administration :

  • Lead the architecture and deployment of HPC-scale services
  • Create and maintain internal documentation of system architectures, configurations and procedures
  • Serve as the highest tier of support for complex issues, providing quick and efficient resolution
  • Develop, maintain and review high quality code for internal tools using programming languages such as Python, Golang, or Rust

Virtualization and Automation :

  • Design, deploy and manage resources in the NCCS VMware environment
  • Identify potential automation targets and lead efforts to automate processes
  • Define policies and procedures for automation and configuration management for the team and organization as a whole
  • Identity Management and Security :

  • Design and administration of RSA SecureID and PingFederate servers
  • Deploy, configure and support identity and access management services such as single-sign on (SSO), OAuth, two-factor auth, zero trust, etc...
  • Project Management and Leadership :

  • Lead infrastructure projects through all phases from planning to design, implementation and support
  • Mentor and train junior staff, creating training documentation, holding knowledge sharing sessions, and fostering skill growth throughout the team
  • Propose and implement improvements to existing infrastructure systems as well as new systems, processes and procedures
  • Basic Qualifications

  • Bachelor's degree in computer science or closely related field and a minimum of 7 years of experience in Linux systems administration, or a Master\'s Degree and a minimum of 4 year of experience in Linux systems administration. An equivalent combination of education and experience will be considered.
  • Preferred Qualifications

  • Excellent interpersonal / communication skills and the ability to work within a team
  • Strong experience in Identity Management, supporting SSO, OAuth, two-factor authentication primarily in PingFederate and RSA SecureID. Entra ID experience a bonus.
  • Strong working knowledge of Linux system fundamentals and common network protocols
  • Programming and scripting skills in common languages such as Python and bash
  • Understanding of versioning and code review tools like GitHub and GitLab
  • Experience implementing and supporting highly-available systems and services
  • Experience with configuration management tools such as Puppet or Ansible
  • Experience deploying and maintaining virtual environments using VMware
  • Experience deploying, maintaining and troubleshooting a variety of infrastructure services such as OpenLDAP, DNS, DHCP, etc...
  • Ability to plan, prioritize and complete assigned projects with minimal supervision
  • Security, Credentialing, and Eligibility Requirements

    For employment at Oak Ridge National Laboratory (ORNL), a Real ID compliant form of identification will be required. ORNL is subject to Department of Energy (DOE) access restrictions. All employees must be able to obtain and maintain a federal Personal Identity Verification (PIV) card as mandated by Homeland Security Presidential Directive 12 (HSPD-12) and DOE Order 473.1A, which requires a favorable post-employment background investigation. To obtain this credential, new employees must successfully complete and pass a Federal Tier 1 background check investigation. This investigation includes a declaration of illegal drug activities, including use, supply, possession, or manufacture within the last year.

    For foreign national candidates : If you have not resided in the U.S. for three consecutive years, you are not eligible for the PIV credential and will need to obtain a favorable Local Site Specific Only (LSSO) risk determination to maintain employment. Once you meet the three-year residency requirement, you will be required to obtain a PIV credential to maintain employment.

    About ORNL

    As a U.S. Department of Energy (DOE) Office of Science national laboratory, ORNL has an 80-year legacy of addressing the nation\'s most pressing challenges. Our team is made up of over 7,000 dedicated and innovative individuals. Our goal is to create an environment where a variety of perspectives and backgrounds are valued, ensuring ORNL is known as a top choice for employment. These principles are essential for supporting our broader mission to drive scientific breakthroughs and translate them into solutions for energy, environmental, and security challenges facing the nation.

    ORNL offers competitive pay and benefits programs to attract and retain individuals who demonstrate exceptional work behaviors. The laboratory provides a range of employee benefits, including medical and retirement plans and flexible work hours, to support the well-being of you and your family. Employee amenities such as on-site fitness, banking, and cafeteria facilities are also available for added convenience.

    Other benefits include : Prescription Drug Plan, Dental Plan, Vision Plan, 401(k) Retirement Plan, Contributory Pension Plan, Life Insurance, Disability Benefits, Generous Vacation and Holidays, Parental Leave, Legal Insurance with Identity Theft Protection, Employee Assistance Plan, Flexible Spending Accounts, Health Savings Accounts, Wellness Programs, Educational Assistance, Relocation Assistance, and Employee Discounts.

    If you have difficulty using the online application system or need an accommodation to apply due to a disability, please email : ORNLRecruiting@ornl.gov

    This position will remain open for a minimum of 5 days after which it will close when a qualified candidate is identified and / or hired.

    We accept Word (.doc, .docx), Adobe (unsecured .pdf), Rich Text Format (.rtf), and HTML (.htm, .html) up to 5MB in size. Resumes from third party vendors will not be accepted; these resumes will be deleted and the candidates submitted will not be considered for employment. If you have trouble applying for a position, please email ORNLRecruiting@ornl.gov.

    ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. UT-Battelle is an E-Verify employer.

    J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Senior Linux Engineer • Oak Ridge, TN, US