Senior UNIX HPC Systems Administrator

University of Maryland Baltimore County
Baltimore, MD
$121K a year
Full-time

Department :

The has approximately 85 full time staff members and 100 undergraduate or graduate students that work each day to support the mission of UMBC through the delivery of IT services to the campus.

DoIT provides robust, secure IT environments that enable solutions for advancing the UMBC community. We do this through a staff of technology professionals that are connected both nationally and to UMBC.

Learn more about our work at

Position Overview :

The Senior UNIX HPC Administrator is part of a team that is responsible for the day-to-day operations of the research computing infrastructure managed by the Division of Information Technology.

The position will utilize modern system administration techniques such as orchestration, automation, and containerization to build software, deploy updates, provision new systems, manage configuration states and troubleshoot existing deployments.

Why Work at UMBC?

UMBC offers competitive compensation. This role starts at $121,000 and has over 4 weeks of vacation for regular full time roles.

Tuition remission is also available.

What is it like to work at UMBC? Check out or And read about our recent award,.

Telework :

A hybrid telework schedule is available!

Responsibilities :

Specific responsibilities include :

  • Work closely with various IT groups and departments, including but not limited to, Networks, Windows Administration, Security, Application, and Middleware Administration to assist in overall architectural design, implementation and troubleshooting
  • Develop and maintain operational guidelines for the maintenance and support of the HPC / Research environments
  • Design, troubleshooting, and maintenance of the following Software : NVIDIA Bright Cluster Manager, Red Hat Enterprise, Slurm
  • Linux system administration, security patching, OS upgrades, troubleshooting problems, and ensuring maximum availability
  • Work both independently and collaboratively with teams to troubleshoot service issues
  • Assist researchers with software builds, environment configuration, and technical support
  • Provide excellent customer service skills and demonstrate the ability to work with all levels within the organization, assuring prompt, and effective responses to customer needs
  • Utilize standard communication, reporting and documentation tools to effectively and efficiently communicate with the team, and document technical solutions
  • Help develop project plans, effectively create / update issues and keep team leads and management informed of changes, impediments, and updates
  • Perform additional duties as assigned

Required Minimum Qualifications :

Bachelor's Degree with at least 5-7 years experience working in a UNIX system administrator or engineering role with direct experience with at least some HPC technologies.

Some of these technologies include : Cluster managementParallel computingSlurm or other workload managersGPU programming and supportBuilding software for multiple computing architectures

  • Experience with NVIDIA Bright Cluster Manager or other cluster management software
  • Experience with versioning tools such as Git or Subversion
  • Install and / or configuration of CEPH, parallel or high performance file systems
  • Slurm or other cluster computing job management
  • Server class hardware deployment and remote management
  • Experience in the installation, maintenance, operation, tuning and troubleshooting of Linux and related systems and software
  • Ability to install, modify, integrate, and configure commercial and open source software applications and utilities
  • Experience supporting customer requests and working with stakeholders to gather and fulfill project requirements
  • Capable of managing time effectively, working both independently and as part of a team
  • Enthusiasm for learning new skills and adapting to a dynamic environment
  • Strong interpersonal skills, enthusiasm for customer service, and the ability to work with students, staff, and faculty from diverse backgrounds
  • Excellent written and verbal communication skills

Preferred Qualifications :

  • Bachelor's Degree preferably in Computer Science, Information Systems or related field
  • HPC knowledge around cluster builds, software, parallel computing, workload management, and cluster management.
  • Knowledge with CUDA Programming Workflows, GPU programming and GPU support.
  • Experience with GPU and specialized hardware for Artificial Intelligence and Machine Learning
  • Experience with Infiniband networking
  • Hypervisor and virtualization technologies including, but not limited to VMware, KVM and Docker

Background Screening Statement :

A background check will be required.

30+ days ago
Related jobs
Promoted
Nightwing
Annapolis Junction, Maryland

The Systems Administrator will provide support for implementation, troubleshooting and maintenance of Information Technology (IT) systems. Configures and manages UNIX and Windows operating systems and installs/loads operating system software, troubleshoots, maintains integrity and configures network...

Promoted
Peraton
Fort Meade, Maryland

Support the design of systems, mission architecture and associated hardware. Minimum Twelve (12) years' experience as a System Administrator on programs or contracts of similar scope, type, and complexity is required. Experience in leading a technical team (system administrators, ISSEs, etc). ...

Promoted
Lockheed Martin
Annapolis Junction, Maryland

As a Cloud Linux Systems Administrator, you'll be empowered to create 'new realities' and pioneer solutions that break boundaries. You are a Cloud/Linux Systems Administrator who will support the DCS TTO Integration Team. A High School Diploma or GED plus twenty (20) years of experience as a systems...

Promoted
Booz Allen Hamilton
Fort Meade, Maryland

Air Force? We’re looking for a Systems Administrator with a solid background in Windows systems administration and applying systems management tools to help us operate and maintain a key coalition test and training environment. As a Systems Administrator on our project, you’ll assign personnel to ta...

Promoted
Peraton
Laurel, Maryland

Configure and manage LINUX, UNIX, and Windows operating systems and installs/loads operating systems software, troubleshoot, maintain integrity of and configure network components, along with implementing operating systems enhancements to improve reliability and performance. Peraton is hiring a Seni...

Promoted
ManTech
Annapolis Junction, Maryland

Providing support for implementation, troubleshooting and maintenance of Information Technology (IT) systems. Management of IT system infrastructure and any processes related to those systems. Providing support to IT systems including day-to-day operations, monitoring, and problem resolution for cli...

Promoted
Leidos Inc
Columbia, Maryland

Bachelor's degree in system engineering, Computer Science, Information Systems, Engineering Science, Engineering Management, or a related technical field and eight (8) years of experience as a systems administrator. Leidos is currently seeking a Systems Administrator to join our prime Leidos contrac...

Omega Enterprise Solutions, LLC
Annapolis Junction, Maryland

Senior HPC Systems Administrator. HPC farm systems, HPC MPP clustered systems, Front End servers of Special Purpose devices (SPDs) IBM of HP Blade servers with FC/SAS/Network back end. HPC SYSTEM ADMINISTRATOR IV shall have a Bachelor’s degree in Computer Science or related field, and have ten years...

Orison Solutions
Ellicott City, Maryland

Role Description:The Senior AWS Systems Administrator will provide support for the daily operations and maintenance of a federal customer network, including Linux and Windows AWS EC2 instances, S3 storage buckets, Syslog, and CloudWatch logging. Duties & Responsibilities:<br /><br />...

Belay Technologies
Annapolis Junction, Maryland

Candidates are required to have the following skills: Experienced with Linux, Windows, VMWare Support the development of Service Now Workflows, deploy Service Now into the client environment Provides support for implementation, troubleshooting and maintenance of IT systems Manages the daily act...