Search jobs > New York, NY > System engineer

Scientific Systems-HPC Engineer - PACC (CPMI)

Columbia University
New York, NY, United States
$150K-$175K a year
Full-time
  • Job Type : Officer of Administration
  • Regular / Temporary : Regular
  • Hours Per Week : 35
  • Salary Range : $150,000 - $175,000

The salary of the finalist selected for this role will be set based on a variety of factors, including but not limited to departmental budgets, qualifications, experience, education, licenses, specialty, and training.

The above hiring range represents the University's good faith and reasonable estimate of the range of possible compensation at the time of posting.

Position Summary

The Scientific Systems-HPC Engineer, reporting to the Director of Bioinformatics and the Director of the Columbia Precision Medicine Initiative, is responsible for supporting the computational needs of genomics analysis and bioinformatics research.

This role involves maintaining analysis pipelines, deploying and operating High-Performance Computing (HPC) environments, and managing associated applications.

Responsibilities

  • Collaborate with the Director of Bioinformatics, Software Engineers, Data Scientists, InfoSec, and Application Administrators to design, deploy, and support research systems within a dynamic HIPAA / PHI environment.
  • Partner with stakeholders to design, build, configure, and support bioinformatics systems, HPC clusters, and batch compute workflows in our cloud environment.
  • Ensure the performance, scalability, and security of analysis pipelines and HPC clusters in collaboration with business application owners and technical resources.
  • Diagnose and resolve system, software, and storage issues in coordination with CUIMC and vendor teams.
  • Monitor system performance, analyze log files and data outputs, and manage issue triage using a ticketing system.
  • Utilize automation tools to deploy infrastructure and applications.
  • Implement patching solutions, backup, and disaster recovery plans, ensuring secure data handling and storage.
  • Propose and implement cost optimization strategies for application and system deployments.
  • Consider cloud infrastructure, application design, and monitoring when designing cloud solutions.
  • Establish security policies and best practices in collaboration with InfoSec.
  • Participate in code reviews and design discussions to better understand deployed systems.
  • Conduct disaster recovery readiness assessments, including periodic tabletop exercises.
  • Evaluate existing bioinformatics and HPC infrastructure, proposing enhanced computing architectures and conducting proof-of-concept tests.
  • Perform design reviews and risk assessments for new applications integrating with core services.
  • Guide product engineering teams in adopting security standards within the software development lifecycle.
  • Create and maintain documentation for new and existing processes and deployments.
  • Perform other duties as assigned.

Minimum Qualifications

  • Bachelor's degree or equivalent in education and experience, plus four years of related experience
  • Experience in an HPC infrastructure environment.
  • Experience with cloud-native solutions across an enterprise.
  • Professional certifications (e.g., AWS Solution Architect, Google Cloud Architect, Red Hat Linux).
  • Ability to work effectively in a team as well as independently on projects and tasks.
  • Strong experience in supporting genomic analysis pipelines and HPC clusters.
  • Experience with architecture, migration, and deployment of on-prem applications in AWS / GCP and hybrid environments.
  • Experience in migrating on-prem applications to cloud-native solutions.
  • Excellent written and verbal communication skills.
  • Ability to work in a fast-paced, deadline-driven environment with changing priorities and multiple projects.
  • Precision and attention to detail are essential.
  • Ability to work with minimal supervision and mentor junior engineers.
  • Willingness to work weekends and off-hours as needed.

Preferred Qualifications

  • Experience in stakeholder management within complex organizations, with a consultant mindset to ensure client satisfaction and timely delivery of bioinformatics / HPC solutions.
  • In-depth knowledge of Linux, system administration tools, and system security.
  • Proficiency in at least one scripting or programming language (e.g., Bash, Java, Python, Go).
  • Knowledge of programming languages such as C++ is a plus.
  • Knowledge of orchestration tools (e.g., Ansible, Chef, Puppet) and the ability to automate processes and workflows.
  • Experience managing HPC environments and using tools like Slurm, SGE, MemVerge, or AWS Parallel Cluster.
  • Experience deploying workloads in cloud environments such as AWS or GCP.
  • Familiarity with storage solutions like AWS EFS or NetApp, and associated protocols (e.g., CIFS / SMB, Lustre, NFS).
  • Experience with database systems such as MySQL, PostgreSQL, and NoSQL options like DynamoDB, MongoDB / DocumentDB.
  • Proficiency with Infrastructure as Code tools such as AWS CloudFormation and Terraform.
  • Experience using source control tools like AWS CodeCommit, GitHub, or GitLab.
  • Familiarity with container solutions such as Docker and Kubernetes.
  • Experience working in diverse research and healthcare research environments.
  • Ability to conduct Total Cost of Ownership (TCO) analysis and consider security in architecture design and deployment.
  • Understanding of cloud deployment pipelines and open-source technologies.
  • 8+ years of experience in infrastructure engineering and Linux support.

Other Requirements

Successful completion of applicable compliance and systems training requirements

Equal Opportunity Employer / Disability / Veteran

Columbia University is committed to the hiring of qualified local residents.

10 days ago
Related jobs
Promoted
Columbia University
New York, New York

The Scientific Systems-HPC Engineer, reporting to the Director of Bioinformatics and the Director of the Columbia Precision Medicine Initiative, is responsible for supporting the computational needs of genomics analysis and bioinformatics research. Collaborate with the Director of Bioinformatics, So...

Promoted
Disney Entertainment & ESPN Technology
New York, New York

The DEE Technology Productivity Engineering team is seeking a Software Engineer who has a true passion for using software engineering to build quality into software applications. This engineer will help us develop tools and write tests that support a large variety of Disney software products on web,...

Promoted
Capital One
New York, New York

Senior Software Engineer, Backend (AWS, Python). We are seeking a Senior Software Engineer who is passionate about marrying data with emerging technologies. As a Capital One Senior Software Engineer, you’ll have the opportunity to be on the forefront of driving a major transformation within Capital ...

Promoted
Scale AI, Inc.
New York, New York

We are seeking a highly skilled Infrastructure Security Engineer to join our team. You will be responsible for securing large cloud environments, orchestrating and securing various compute clusters, and reviewing infrastructure as code. Your expertise in cloud security, infrastructure automation, an...

Promoted
Cloud 88 Inc
New York, New York

Location:Hybrid schedule at any of the following locations: Pittsburgh, PA; Jersey City, NJ; or New York, NY; Will require night and weekend work....

Promoted
GNY Insurance Companies
New York, New York

As part of the IT Operations team, the Network Administrator will be part of the team responsible for supporting the network infrastructure and related systems. Network Administrator supporting firewalls, switches and other related network devices and tools. This position is operationally responsibl...

Promoted
VirtualVocations
Queens, New York

Key Responsibilities:Responsible for critical infrastructure componentMaintain, troubleshoot, and improve Linux systemsAutomate infrastructure code using Ansible and TerraformRequired Qualifications:Minimum 5 years of Linux and server administration experienceIn-depth knowledge of Linux systems, pre...

Promoted
Diverse Lynx
New York, New York

AWS DevOps CloudOps , CICD pipelines, Python scripting, GitHub, Infrastructure as Code (IaaC) tools (e. ...

Promoted
Amazon.com
New York, New York

As a Lead Software Development Engineer, it’s up to you to define, design and refine the tech that keeps us one step ahead of listeners. As a Lead Software Development Engineer, you will. Write high-quality, efficient, testable code and recommend improvements in development, maintenance, and system ...

Promoted
Two95 International Inc.
New York, New York

Rate: $Open Responsibilities and Duties: Deploying, automating, maintaining and managing AWS cloud-based production system, to ensure the availability, performance, scalability, and security of productions systems.Build, release and configuration management of production systems.Automated applicat...