Overview
Lead High Performance Computing Engineer. GW Information Technology (GW IT) provides empowering tools and caring support for all members of The George Washington University (GW). RTS (Research Technology Services) supports cyberinfrastructure systems and services to enable GW's research mission, including HPC clusters, cloud infrastructure, storage platforms, and analytics tools. The Lead HPC Engineer collaborates with a team of HPC engineers and researchers to design, implement, and maintain HPC systems, align services with strategic goals, and provide advanced technical leadership and mentorship within RTS.
Responsibilities
- Design, implement, and maintain high-performance computing systems to meet the computational needs of RTS.
- Lead operations of multiple HPC systems, contribute to strategic planning for next-generation services, and act as highest-tier escalation for operational issues.
- Engage with GW research communities to define and deliver HPC and related compute and storage infrastructure supporting evolving research needs.
- Develop and conduct advanced training; mentor other engineers to enhance interdisciplinary capabilities within Research Technology Services.
- Collaborate across HPC, data management, cloud, cybersecurity, networking, and software teams to adopt new technologies and ensure robust, scalable services.
- Lead or participate in outreach, education, and workshops to update users on HPC developments, tools, and resources.
Areas of Focus
Research Computing & Data : categorize demands, match to platforms (cloud, HPC, HTC), assist researchers with compliance in handling restricted data, and develop knowledge base resources.High-Performance Computing & Big Data : implement job schedulers and data transfer solutions, design scalable clusters and storage, integrate HPC with big data platforms (e.g., Hadoop, Spark), optimize workflows, and support ongoing enhancements.Cloud Computing : architect and manage cloud-based HPC solutions (AWS, Azure, Google Cloud), migrate workloads to cloud, ensure hybrid integration, monitor and maintain cloud performance and availability.Networking : support high-speed networking (InfiniBand, Ethernet), ensure secure data transfers, implement security practices, and troubleshoot connectivity.Security & Identity : collaborate with security teams to assess risks, implement security postures, and manage identity across research infrastructure.Data Management & Storage : manage large data volumes, optimize performance and disaster recovery, and propose new storage solutions.Applications & Collaboration : ensure applications run efficiently, provide training, deploy and upgrade applications, and support user needs.Qualifications
Bachelor's degree in a related field with 5 years of relevant experience, or Master's degree with 3 years of relevant experience (degree must be conferred by start date). Substitutions permitted via equivalent combination of education and experience.Preferred : large-scale production HPC experience; strong knowledge of HPC concepts, including parallel architectures, Slurm, InfiniBand / Ethernet; programming in scientific computing languages; experience with HPC storage systems (Lustre, GPFS) and data management; leadership and communication skills; knowledge of security best practices in HPC; excellent written and verbal communication; ability to adapt to changing priorities; strong analytical and troubleshooting skills; scripting (Perl, Python, Bash); experience with Linux kernel modules related to Lustre, NVIDIA GPUs, Mellanox InfiniBand; familiarity with Slurm and other schedulers; virtualization for image management; ticketing and SLA familiarity.Hiring Details
Hiring range : $92,790.58 – $150,696.60Campus : GW Ashburn with option for Foggy Bottom; travel between campuses may be expected.Work arrangement : Hybrid; on-site required in some capacity; telework available.Background checks : criminal history, education / degree / certifications verification, SSN trace, sex offender registry search.Special instructions : Employment visa sponsorship not available for this role; internal GW applicants only.EEO Statement : The university is an Equal Employment Opportunity employer that does not unlawfully discriminate in any of its programs or activities on the basis of race, color, religion, sex, national origin, age, disability, veteran status, sexual orientation, gender identity or expression, or any other basis prohibited by applicable law.
J-18808-Ljbffr