Search jobs > Alhambra, CA > System administrator

HPC System Administrator

InsideHigherEd
Alhambra, California, USA
$158.8K a year
Full-time

HPC System Administrator

University of California Los Angeles

Budgeted Pay Scale :

Full Salary Range : USD $76,200.00 / Yr. - USD $158,800.00 / Yr.

Department Summary

UCLA's Office of Advanced Research Computing (OARC) melds expert staff and technical infrastructure to amplify and accelerate the impact of UCLA research in the age of networked data and computation.

OARC expertise and resources are available to all UCLA researchers, who are engaged in digital research and scholarship.

We work with faculty, student, and postdoctoral researchers; instructors; and staff and administrators. OARC is a relationship-building organization.

We enable digital scholarship through collaborations, partnerships, and networked communities to advance cutting-edge research capabilities at UCLA and beyond.

OARC supports and enhances the university mission of education, research, and service through the development and execution of innovative and sustainable technology practices, programs, services, infrastructure, policies, and partnerships.

Position Summary

HPC System Administrator

UCLA's Office of Advanced Research Computing (OARC) supports and enhances the university mission of education, research, and service through the development and execution of innovative and sustainable technology practices, programs, services, infrastructure, policies, and partnerships.

The OARC High Performance Computing (HPC) Systems Research Technology Group (RTG) supports thousands of UCLA researchers and over 300 research groups through consultation and the operation of the Hoffman2 High Performance Research Cluster.

More information on the Hoffman2 cluster may be found at Hoffman2 Cluster Documentation

The Hoffman2 cluster environment consists of approximately 1000 compute nodes, GPU nodes, high speed networking, high-performance storage, backup equipment, and extensive hardware and software support infrastructure, spread across multiple data centers.

The HPC System Administrator, as part of the HPC team, will serve as a technical expert supporting OARC's HPC environment in the areas of systems and application software development, HPC cluster system administration and management of the backup system environment.

Requires the ability to work from UCLA's Westwood campus as operational demands dictate. FlexWork / hybrid schedules will be considered based on work demands and operational needs.

Salary & Compensation

UCLA provides a full pay range. Actual salary offers consider factors, including budget, prior experience, skills, knowledge, abilities, education, licensure and certifications, and other business considerations.

Salary offers at the top of the range are not common. Visit UC Benefit package to discover benefits that start on day one, and UC Total Compensation Estimator to calculate the total compensation value with benefits.

Qualifications

3 years Experience with software and applications development, Linux system administration, and two or more modern programming languages (e.

g. Python, C++, Java). (Required)

Expert knowledge of Python, SQL, bash, git, and associated build systems, libraries, and development tools. Demonstrated knowledge of common programming paradigms (e.

g., asynchronous, concurrent, and object-oriented). Demonstrated ability to create high-quality system tools and software. (Required)

Ability to work independently or in a development team, and effectively estimate time and effort required to complete tasks.

Ability to analyze, benchmark, debug, and test software in a technically sound manner and to generate clear, readable reports and summaries. (Required)

  • Demonstrated working knowledge of HPC cluster architectures and concepts (e.g., provisioning, benchmarking, scalability, and parallelizing code) and ability to stay current with industry best practices. (Required)
  • Detailed knowledge of Red Hat Enterprise Linux and related distributions. Solid system administration skills including scripting, pipelines, and UNIX operating system fundamentals. (Required)
  • Working knowledge of protocols, applications, and formats including, but not limited to, TCP / IP, HTTP, DHCP, SSH, NFS, JSON, XML, and HTML. (Required)
  • Demonstrated ability to troubleshoot and debug computing problems including, corrupted data, file management, application software, and operating system problems.

Accurately, and independently respond to production problems in multiple complex operating systems and software components. (Required)

  • Knowledge of validation, verification, and disaster recovery capabilities for both hardware and software. (Required)
  • Demonstrated skill in writing well-organized, complete, and technically and grammatically correct documents and procedures to be used by technical and non-technical personnel of diverse backgrounds at various levels in the organization, including researchers, peers, and management. (Required)
  • Demonstrated oral communication and presentation skills sufficient to effectively obtain and impart technical information and explain concepts on a one-to-one basis as well as in meetings with or presentations to multiple clients. (Required)
  • Demonstrated problem-solving skills and the ability to break down and define complex problems, formulate solutions, identify cause and effect relationships, make appropriate decisions, and communicate concepts clearly and appropriately with researchers and peers. (Required)
  • Ability to prioritize tasks, prepare project plans, schedules, effectively manage projects in areas of responsibility, complete tasks, projects in a timely manner.

Work effectively both independently and as part of a team, follow through follow through on assignments with minimal direction. (Required)

Demonstrated skill in establishing and maintaining cooperative working relationships with staff, students, and vendors.

Ability to communicate and interact effectively with persons of diverse backgrounds. (Required)

Education, Licenses, Certifications & Personal Affiliations

  • Bachelor's Degree Bachelor's degree in computer science, software engineering, or a related field. (Required) And
  • Master's Degree Master's degree in computer science, software engineering or a related field . (Preferred)

Special Conditions for Employment

  • Background Check : Continued employment is contingent upon the completion of a satisfactory background investigation.
  • Live Scan Background Check : A Live Scan background check must be completed prior to the start of employment.
  • COVID and Flu Vaccinations : The position is subject to providing evidence of inoculation.

Schedule

8 : 00 am to 5 : 00 pm

Union / Policy Covered

99-Policy Covered

To apply, please visit : https : / / jobs.ucla.edu / careers-home / jobs / 3480

Application Deadline : 8 : 50 p.m. on

Copyright 2024 Jobelephant.com Inc. All rights reserved.

Posted by the FREE value-added recruitment advertising agency

jeid-9a1732bc4c05994cbbee82f5c767dd7f

4 days ago
Related jobs
Promoted
InsideHigherEd
Los Angeles, California

The HPC System Administrator, as part of the HPC team, will serve as a technical expert supporting OARC's HPC environment in the areas of systems and application software development, HPC cluster system administration and management of the backup system environment. The OARC High Performance Computi...

Promoted
University of California - Los Angeles (UCLA)
Los Angeles, California

HPC System Administrator UCLA's Office of Advanced Research Computing (OARC) supports and enhances the university mission of education, research, and service through the development and execution of innovative and sustainable technology practices, programs, services, infrastructure, policies, and pa...

Promoted
JobLookup
Los Angeles, California

The HPC System Administrator, as part of the HPC team, will serve as a technical expert supporting OARC's HPC environment in the areas of systems and application software development, HPC cluster system administration, and management of the backup system environment. The OARC High Performance Comput...

Promoted
Northrop Grumman
Los Angeles, California

Information Technology Professionals, We Want You! * The Northrop Grumman Classified Solutions team is seeking an experienced *Linux Systems Administrator* to join its dynamic team of technical professionals. Roles and responsibilities will include but not be limited to the following: * Perform as p...

Promoted
InsideHigherEd
East Los Angeles, California

Network and Engineering Services manages the engineering and operations of the network infrastructure (e. Internet of Things devices) to the network, network access management, securing access to networks, and appropriate authentication (e. This role will manage and provide day-to-day network engine...

Promoted
Raytheon
Pasadena, California

The Test Equipment Engineering (TEE) organization includes all of the engineering disciplines responsible for systems design & test of all Raytheon products. As a Senior Systems Engineer, you are accountable to coordinate across all engineering teams to define, design, and document Test Environment ...

Promoted
VirtualVocations
Los Angeles, California

A company is looking for a Cloud DevOps Engineer to ensure the reliability, scalability, and performance of their gateway infrastructure. ...

Promoted
GCR Professional Services
Hawthorne, California

The manager is looking for someone who has worked as a Software DevOps Engineer for the last several ; Prefers 8-10+ years of ;. ...

Promoted
Optomi
Los Angeles, California
Remote

This person design our revamped network infrastructure with the goal of maximizing our network performance. Network support and troubleshooting experience for a large network with many users/customers. Optomi, in partnership with an industry leader, is seeking a Network Engineer for a remote positio...

Promoted
World Wide Technology
Rosemead, California

Five years of experience in Network Tool Administration and Network Operations management including hands-on work experience monitoring, maintaining, configuring, and upgrading Networking devices (routers and switches), wireless, or telecommunications infrastructure. Role: Network Tools Administrato...