Talent.com
Senior HPC Storage Systems Engineer
Senior HPC Storage Systems EngineerMartinFed • Oak Ridge, TN, US
serp_jobs.error_messages.no_longer_accepting
Senior HPC Storage Systems Engineer

Senior HPC Storage Systems Engineer

MartinFed • Oak Ridge, TN, US
job_description.job_card.1_day_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Join to apply for the Senior HPC Storage Systems Engineer role at MartinFed

Company Overview

XCEL Engineering, Inc. is an award-winning small business that provides trusted information technology, engineering, consulting and project management solutions and services to federal agencies and organizations. Originally founded in 1971 by professional engineers at the University of Tennessee, XCEL was acquired in 2003 by U.S. Army and Navy veterans and in 2023 became a MartinFed company.

XCEL Engineering is a part of IT Lab Partners (ITLP) which was created to support a leading research facility in the East Tennessee region in recruiting the best and the brightest technical talent.

Job Overview

Xcel Engineering is seeking a Senior HPC Storage Systems Engineer to design, operate and maintain clusters, servers, and workstations storage supporting services where science happens at ORNL! This position resides in the Emerging Technologies & Computing team in the Research Computing group in the Information Technology Services Directorate at Oak Ridge National Laboratory (ORNL).

The Emerging Technology Computational Group facilitates goals through HPC systems engineering, integration, and support for the research community. By providing design, deployment, optimization, monitoring, and tooling support across multiple clustered storage infrastructures, we facilitate Lab-wide R&D projects.

Essential Functions

  • Architect, deploy, and manage large-scale HPC storage systems, including parallel file systems such as Lustre, GPFS / Spectrum Scale, BeeGFS and WEKA
  • Design, implement, and operate large-scale Ceph storage clusters for HPC and research workloads, delivering reliable, high-performance object, block, and file storage services.
  • Ensure the availability, performance, scalability, and security of production storage environments.
  • Administer and optimize enterprise storage platforms such as Qumulo and NetApp in support of HPC and research workloads.
  • Design, deploy, and maintain archival storage solutions including Spectra Logic BlackPearl and large-scale tape libraries to ensure long-term data preservation and accessibility.
  • Integrate high-performance, enterprise, and archival storage layers into cohesive tiered storage architectures that balance cost, scalability, and performance for diverse scientific workflows.
  • Leverage automation and monitoring solutions to minimize day-to-day maintenance while identifying opportunities to optimize system performance and management.
  • Collaborate with researchers and technical POCs to support large data workflows and optimize I / O performance for scientific workloads.
  • Automate storage provisioning, monitoring, and maintenance using scripting and configuration management tools.
  • Diagnose and resolve complex storage and I / O-related issues in high-throughput, low-latency HPC environments.
  • Evaluate emerging storage technologies (NVMe, object storage, hierarchical storage management, burst buffers) and contribute to strategic planning for future HPC systems.
  • Work with 24 / 7 operations staff to streamline monitoring and troubleshooting, significantly reducing the need for off-hours support.
  • Deliver ORNL's mission by aligning behaviors, priorities, and interactions with our core values of Impact, Integrity, Teamwork, Safety, and Service. Promote equal opportunity by fostering a respectful workplace.

Basic Qualifications

  • A BS degree in computer science, computer engineering, information technology, information systems, science, engineering, or related discipline and 8-12 years of relevant professional experience; or an equivalent combination of education and experience.
  • Master's degree holders : 7-10 years of relevant experience.
  • PhD holders : 4-6 years of relevant experience.
  • Five (5) or more years managing UNIX / Linux systems.
  • Demonstrated experience managing HPC storage and large-scale enterprise storage systems.
  • Three (3) or more years working with configuration management and automation tools such as Git, Jenkins, Ansible, or Puppet.
  • Proficiency with at least one scripting language (Bash, Python, Perl, etc.).
  • Strong Linux administration and advanced troubleshooting experience.
  • Experience supporting large data systems and / or HPC scientific workloads.
  • Strong desire to innovate and evaluate new technologies for HPC and storage environments.
  • Collaborative approach and ability to become a trusted advisor to research teams.
  • Desired Qualifications

  • Active DOE Q, DoD Top Secret, or TS / SCI clearance is strongly preferred.
  • Solid understanding of multiple operating systems and HPC cluster technologies.
  • Experience with Rocky / CentOS / RHEL, Ubuntu, VMware.
  • Understanding of HPC job schedulers (SLURM) and user support workflows.
  • Experience with container technologies in HPC environments.
  • Experience with multiple system deployment mechanisms (Warewulf, PXEboot, Cobbler, Bright).
  • Experience with GPU clusters (NVIDIA, AMD) for AI / ML and scientific workloads.
  • Deep expertise with high-performance parallel file systems (Lustre, GPFS / Spectrum Scale, BeeGFS, WEKA).
  • Knowledge of storage networking (Infiniband, NVMe-oF, SAN / NAS architectures).
  • Familiarity with RAID, ZFS, and object storage technologies.
  • Strong background in performance monitoring, benchmarking, and I / O optimization.
  • Experience with monitoring systems such as Grafana, CheckMK, Nagios, Zabbix, Ganglia.
  • Previous experience working in a government, scientific, or other highly technical environment.
  • Strong documentation skills and ability to prepare web-based documentation.
  • Physical Requirements & Environmental Conditions

  • Inside office environment.
  • Working on a computer for long periods of time.
  • May involve long period of sitting at a desk.
  • The work environment is fast-paced and sometimes involves extreme deadline pressures.
  • Xcel Engineering is an Equal Opportunity / Affirmative Action Employer. All qualified applicants will receive consideration for employment without regards to race, color, religion, religious creed, gender, sexual orientation, gender identity, gender expression, transgender, pregnancy, marital status, national origin, ancestry, citizenship status, age, disability, protected Veteran Status, genetics or any other characteristics protected by applicable federal, state or local law.

    If you are a qualified individual with a disability or disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access Xcel Engineering's current openings as a result of your disability. You can request reasonable accommodations by calling 855.212.1810.

    J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Senior Storage Engineer • Oak Ridge, TN, US

    Job_description.internal_linking.related_jobs
    HP Nonstop Engineer

    HP Nonstop Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for an HP Nonstop Engineer for a W2 role.Key Responsibilities Design, code, test, and deploy COBOL code in a high-risk environment Handle real-time authorization processing ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Systems Engineer

    Senior Systems Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a Staff Advanced Concepts Systems Engineer.Key Responsibilities Lead the creation of mission concepts, reference architectures, and CONOPS for future space missions Inte...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior HPC Storage Systems Engineer

    Senior HPC Storage Systems Engineer

    Xcel Engineering • Oak Ridge, TN, USA
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Originally founded in 1971 by professional engineers at the University of Tennessee, XCEL was acquired in 2003 by U.Army and Navy veterans and in 2023 became a MartinFed company.XCEL Engineering is...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days
    CDL-A Flatbed Drivers : Earn $1700-$1800+ / Wk incl. Tarp Pay. Deluxe Equip! Knoxville, TN

    CDL-A Flatbed Drivers : Earn $1700-$1800+ / Wk incl. Tarp Pay. Deluxe Equip! Knoxville, TN

    Alabama Motor Express • La Follette, TN, USA
    serp_jobs.job_card.full_time
    Our Flatbed Freight Volumes are BOOMING! If You Want Lots of Miles and to Earn BIG Weekly Paychecks, Contact Us Today!.No Flatbed Experience Necessary. Flatbed Driver Benefits Include : .Top Drivers A...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Systems Engineer

    Systems Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a Systems Engineer.Key Responsibilities Research, develop, and deploy Office 365 solutions, including proof of concepts Collaborate with IT staff and stakeholders to und...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Control Systems Programmer

    Senior Control Systems Programmer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior AV Systems Programmer.Key Responsibilities Lead the design, development, and implementation of complex AV control system programs and DSP audio designs Set prog...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior Support Engineer

    Senior Support Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    Support Engineer (Pacific Time Working Hours).Key Responsibilities Lead the intake, triage, and routing of complex support requests across all product areas Conduct thorough troubleshooting of c...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Staff Linux Systems Engineer

    Senior Staff Linux Systems Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Staff Linux Systems Engineer, Compute & Storage.Key Responsibilities Optimize kernel and OS for compute nodes and storage clusters Define health standards for p...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior IT Systems Engineer

    Senior IT Systems Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior IT Systems Engineer to ensure the availability and performance of the Assure DQ applications. Key Responsibilities Monitor the environment, respond to and resolve...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Electrical Engineer

    Sr. Electrical Engineer

    Jobot • Alcoa, TN, US
    serp_jobs.job_card.full_time
    Electrical Engineer needed for well established global manufacturing company.This Jobot Job is hosted by : Joseph Calabrese. Are you a fit? Easy Apply now by clicking the "Apply Now" button...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Directory Services Engineer

    Senior Directory Services Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    Lead Directory Services Engineer responsible for leading and advancing enterprise directory infrastructure across various environments. Key Responsibilities Design, secure, and maintain directory ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Technical Support Engineer

    Senior Technical Support Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    Technical Support Representative for a SaaS platform (Remote).Key Responsibilities Provide troubleshooting and guidance on Rave products, managing support requests in a timely manner Conduct tra...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Engineer

    Senior Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    Engineer who will serve as a technical expert in application and systems engineering.Key Responsibilities Design and develop business-critical software solutions based on IT strategy Collaborate...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Storage Engineer

    Senior Storage Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Storage Engineer to manage and support their storage environment and lead migration efforts. Key Responsibilities Manage daily operations of enterprise storage sy...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Senior HPC Engineer

    Senior HPC Engineer

    Oak Ridge National Laboratory • Oak Ridge, TN, US
    serp_jobs.job_card.full_time
    Select how often (in days) to receive an alert : .We are hiring a Senior Linux HPC Systems Engineer to design, operate and maintain clusters, servers, and workstations supporting services where scien...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Linux Systems Administrator

    Senior Linux Systems Administrator

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Linux Systems Administrator (Ubuntu / Red Hat).Key Responsibilities Manage, monitor, and maintain Ubuntu and Red Hat Linux servers in production and staging envi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Senior Solutions Engineer

    Senior Solutions Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Solutions Engineer for their Mid-Market business.Key Responsibilities Partner with sales teams on Mid-Market Accounts to track customer profiles and optimize sol...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    SOC Splunk SOAR Engineer

    SOC Splunk SOAR Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a SOC / Splunk SOAR Engineer.Key Responsibilities Monitor, detect, and respond to security incidents using SIEM and EDR tools Conduct deep-dive investigations into comple...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    CDL-A Drivers : Earn Up To $1500+ / Wk (paid hrly or cpm)! 100% No Touch Knoxville, TN

    CDL-A Drivers : Earn Up To $1500+ / Wk (paid hrly or cpm)! 100% No Touch Knoxville, TN

    Alabama Motor Express • La Follette, TN, USA
    serp_jobs.job_card.full_time
    Take Advantage of Our Freight Network & Decked Out Trucks To Log Big Miles and Earn Big Paychecks! .AMX Network Driver Benefits Include : . Pay Based on Hourly Rate or CPM .AMX is Committed to Getting...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior System Software Engineer

    Senior System Software Engineer

    VirtualVocations • Knoxville, Tennessee, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior System Software Bringup Engineer.Key Responsibilities Lead and drive system bringup for GPU-centric server platforms in factory and data center environments Des...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted