(Senior) Bioinformatics Data Engineer, Omics Pipelines, Translational and Quantitative Sciences Data Engineering

Genmab

Princeton, NJ

Full-time

The Role

The successful candidate will contribute to the mission of the global data engineering function and be responsible for many aspects of data including creation of data-as-a-product, architecture, access, classification, standards, integration, and pipelines.

Although your role will involve a diverse set of data-related responsibilities, your key focus will be on the creation of bioinformatics pipelines to process bulk and single cell genomics and transcriptomics data for the enablement and downstream interpretation of Translational and Quantitative Sciences functions, including Data Science, Translational Medicine, Precision Medicine, and Translational Research.

You will have a balance of subject matter expertise in life science data, terminology and processes and technical expertise for hands-on implementation.

You will be expected to create workflows to standardize and automate data, connect systems, enable tracking of data, implement triggers and data cataloging.

With your experience in the Research domain, you will possess knowledge of diverse assay types such as IHC, flow cytometry, cytokine data, but specialize in genomics and transcriptomics.

Your ultimate goal will be to place data at the fingertips of stakeholders and enable science to go faster. You will join an enthusiastic, agile, fast-paced and explorative global data engineering team.

Responsibilities

Design, implement and manage ETL data pipelines that process and transform vast amounts of scientific data from public, internal and partner sources into various repositories on a cloud platform (AWS)
Incorporate bioinformatic tools and libraries to the processing pipelines for omics assays such as bulk and single cell RNASeq
Enhance end-to-end workflows with automation that rapidly accelerate data flow with pipeline management tools such as Step Functions, Airflow, or Databricks Workflows in combination with specialized bioinformatics pipeline tools such as WDL, Nextflow, or Snakemake
Implement and maintain bespoke databases for scientific data (RWE, in-house labs, CRO data) and consumption by analysis applications and AI products
Innovate and advise on the latest technologies and standard methodologies in Data Engineering and Data Management, including recent advancements with GenAI, and latest bioinformatics tools, modules and techniques in RNA sequencing analysis
Manage relationships and project coordination with external parties such as Contract Research Organizations (CRO) and vendor consultants / contractors
Define and contribute to data engineering practices for the group, establishing shareable templates and frameworks, determining best usage of specific cloud services and tools, and working with vendors to provision cutting edge tools and technologies
Collaborate with stakeholders to determine best-suited data enablement methods to optimize the interpretation of the data, including creating presentations and leading tutorials on data usage as appropriate
Apply value-balanced approaches to the development of the data ecosystem and pipeline initiatives
Proactively communicate data ecosystem and pipeline value propositions to partnering collaborators, specifically around data strategy and management practices
Participate in GxP validation processes

Requirements

BS / MS in Computer Science, Bioinformatics, or a related field with 5+ years of software engineering experience (8+ years for senior role) or a PhD in Computer Science, Bioinformatics or a related field and 2+ years of software engineering experience (5+ years for senior role)
Excellent skills and deep knowledge of ETL pipeline, automation and workflow managements tools such as Airflow, AWS Glue, AWS Step Functions, and CI / CD is a must.

Strong preference specifically for AWS Step Functions and Lambda.

Excellent skills with bioinformatics pipeline tools and troubleshooting for quality such as Snakemake, WDL, and Nextflow.

Strong preference for Nextflow.

Excellent skills and deep knowledge in Python, Pythonic design and object-oriented programming is a must, including common Python libraries such as pandas.

Experience with R a plus

Excellent understanding of different bioinformatics modules and databases such as STAR, HISAT2, featureCounts, fastQC, RSeQC and Cell Ranger and how they’re used on different types of genomic and transcriptomic data such as single cell transcriptomics
Solid understanding of modern data architectures and their implementation offerings such as Databricks’ Delta Tables, Athena, Glue, Iceberg, and their applications to Lakehouse and medallion architecture.
Experience working with clinical data and understanding of GxP compliance and validation processes
Proficiency with modern software development methodologies such as Agile, source control, project management and issue tracking with JIRA
Proficiency with container strategies using Docker, Fargate, and ECR
Proficiency with AWS cloud computing services such as Lambda functions, ECS, Batch and Elastic Load Balancer and other compute frameworks such as Spark, EMR, and Databricks.

Strong preference for experience with AWS Omics.

For US based candidates, the proposed salary band for this position is as follows :

$,.00 $,.00

The actual salary offer will carefully consider a wide range of factors, including your skills, qualifications, experience, and location.

Also, certain positions are eligible for additional forms of compensation, such as bonuses.

About You

You are passionate about our purpose and genuinely care about our mission to transform the lives of patients through innovative cancer treatment
You bring rigor and excellence to all that you do. You are a fierce believer in our rooted-in-science approach to problem-solving
You are a generous collaborator who can work in teams with diverse backgrounds
You are determined to do and be your best and take pride in enabling the best work of others on the team
You are not afraid to grapple with the unknown and be innovative
You have experience working in a fast-growing, dynamic company (or a strong desire to)
You work hard and are not afraid to have a little fun while you do so

Locations

Genmab leverages the effectiveness of an agile working environment, when possible, for the betterment of employee work-life balance.

Our offices are designed as open, community-based spaces that work to connect employees while being immersed in our state-of-the-art laboratories.

Whether you’re in one of our collaboratively designed office spaces or working remotely, we thrive on connecting with each other to innovate.

30+ days ago

Related jobs

Promoted

Senior Manager, Contracts, Research & Innovation Outsourcing Management - Labs & Data Services

Genmab

Princeton, New Jersey

For more than 20 years, its passionate, innovative and collaborative team has invented next-generation antibody technology platforms and leveraged translational research and data sciences, which has resulted in a proprietary pipeline including bispecific T-cell engagers, next-generation immune check...

Senior Clinical Data Manager

Katalyst HealthCares & Life Sciences

Edison, New Jersey

Responsible for the initiation and approval of the building, testing and validation of clinical databases, subsequent changes in clinical databases and data validation activities. Contributes to upkeep company's DM outsourcing strategies and long-term relationships with outsourcing partners with obj...

Lead Software Engineer, Data Engineering

Panjiva

Princeton, New Jersey

The page you are looking for no longer exists.We’re sorry, but it looks like this job may be no longer available or does not exist.Please click to perform a new job search....

Senior Manager, Data Engineering - Enterprise Identity Services

CVS Health

Work from home, NJ, US

Remote

The data operations team will be responsible for data engineer and analysis, quality measures and improvement, data flows and forecasting, and managing our relationship with our customers. This role will lead a Data Operations team directly responsible for improving the accuracy and understanding of...

Technology and Data - Specialty Software Engineer 4 - Contingent

Mindlance

North Brunswick Township, New Jersey

Contribute to the resolution of complex and multi-faceted situations requiring solid understanding of the function, policies, procedures, and compliance requirements that meet deliverables. Job Description: In this contingent resource assignment, you may: Consult on complex initiatives with broad im...

Senior Mechanical Engineer (Hamilton)

Keystone Engineering Group Inc

Hamilton Township, New Jersey

Professional Engineering Firm and Systems Integrator with 25+ years of experience in Design and Design-Build services for Water, Wastewater, and private industries. Keystone is in search of a Senior Mechanical Engineer with 10 years+ of consulting engineering experience. As a Senior Mechanical Engin...

Sr Data Engineer

PSCI

Hopewell Township, New Jersey

Remote

Should have 5-8+ Years of IT Experience with Data Engineering and DWH, ETL Data Bases, ETL Pipelines, data migration. Good to have) Understanding and Experience with PostgreSQL, PySpark, AWS, AWS Glue, EMR etc. Oracle/ SQL, and very strong with SQL Queries. Should know some Data modelling practices....

MDM Data Engineer (with ETL Testing experience)

HAN

MDM Data Engineer (with ETL Testing experience). Looking for an MDM Data engineer with Experience in ETL Testing. Skill Matrix to be filled by Candidates:. Experience in Python and Pyspark scripting. ...

Data Engineer with BioInformaics exp (No C2C)

Xlysi

Princeton, New Jersey

Role: “Bioinformatics Data Engineers in Research”. Strong knowledge of bioinformatics databases, tools, and resources (e. Bachelor’s or master’s degree in bioinformatics, Computational Biology, Computer Science, Data Science, or a related field. Experience with data processing frameworks and tools (...

SQA & Automation Engineer - Senior Associate

State Street

Princeton, New Jersey

We’re driving the company’s digital transformation and expanding business capabilities using industry best practices and advanced technologies such as cloud, artificial intelligence and robotics process automation. Software Quality assurance and automation engineer will contribute to the manual and ...

(Senior) Bioinformatics Data Engineer, Omics Pipelines, Translational and Quantitative Sciences Data Engineering

Senior Manager, Contracts, Research & Innovation Outsourcing Management - Labs & Data Services

Senior Clinical Data Manager

Lead Software Engineer, Data Engineering

Senior Manager, Data Engineering - Enterprise Identity Services

Technology and Data - Specialty Software Engineer 4 - Contingent

Senior Mechanical Engineer (Hamilton)

Sr Data Engineer

MDM Data Engineer (with ETL Testing experience)

Data Engineer with BioInformaics exp (No C2C)

SQA & Automation Engineer - Senior Associate

Related searches