Search jobs > San Francisco, CA > Data engineer

Data Engineer

Ladders
San Francisco, CA
Full-time

Job Description

Position : Data Engineer

Department : Data Informatics

Reports to : Chief Data Officer

SUMMARY AND DETAILS OF POSITION

Quantum Leap Healthcare Collaborative's (QLHC) Bioinformatics team provides strategic planning, integrating, execution, build and oversight of clinical trial deliverables.

The Bioinformatics group integrates structured and unstructured data across the data sources, setup, data transfer / review and support downstream transformation and analysis.

The Data Engineer leads the data architecture for clinical and biomarker data pre-processing and transformations for loading into analytical platforms accessed by statisticians, analysts, and external users.

The data engineer contributes to the successful conduct of QLHC's clinical trials and to the delivery of high quality promptly, which is eventually used for statistical analysis and submitted to regulatory authorities for the approval of QLHC's products.

Principal Responsibilities MR1 AA2

Programs, configures and maintains the data pipelines that conform to the common data model and ensures data ingestion for all study-level data capture technologies and other related vendor and / or applications (e.

g., LIMS, EDC, IRT, ePRO, eCOA).

  • Collaborate cross functionally, facilitates test data transfer, and confirms accurate DTA specification.
  • Performs tasks to configure and maintain data flow integration between collected data and the clinical data repository (CDR).
  • Configure data extraction and transformations in an individual contributor role across multiple data sources at the study level
  • Partner closely with internal / external stakeholders and data engineers.
  • Ensure accurate delivery of data format and data frequency with quality deliverables per specification.
  • Participate in the development, documentation, testing, maintenance and training rendered by standards and other functions on transfer specs and best practices used by business MR3 AA4 .

Preferred Education and Experience

  • Bachelor's degree plus 2+ years' experience, in computer science, statistics, biostatistics, mathematics, biology or other health related field or equivalent experience that provides the skills necessary to perform the job.
  • You will bring experience with EDC build, Data Management, and EDC extraction configuration.
  • You will bring knowledge of data flow between clinical data management systems, vendor devices and CDR.
  • You will need knowledge of XMLS, ALS, APIs and MDR preferred; and experience with one of these languages : SQL, SAS, R, Python.
  • Strong working knowledge of clinical trial terminology and data transfer specification expected.
  • Proven ability to work independently and collaboratively.
  • Experience with EDC build or data extraction configuration ETL / ELT experience.
  • Understanding of AWS / Data bricks concepts.

Other Preferred Skills

  • Preferred to have SAS or R or Python certification.
  • Demonstrated ability to lead projects and work groups. Project management skills.
  • Travel requirements - 1-2 domestic trips annually
  • Experience developing R shiny and Python apps.
  • Experience with Hadoop
  • Experience with Agile development methods

Disclaimer : This job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee.

Duties, responsibilities, and activities may change, or new ones may be assigned at any time with or without notice.

30+ days ago
Related jobs
Promoted
Ladders
Oakland, California

Analyze data values and data patterns to identify the relationships that link desperate data elements into logical units of information or "business objects". Design, Implementation and Maintenance of Data Model in Datawarehouse composed of data from multiple Systems such as MS SQL server....

Promoted
Sequoia
CA, United States

Software Engineering and Data Platform Engineering with a strong focus on building and managing data pipelines and infrastructure. As a Software Engineer, Data Platform, you will help to build our new secure data storage platform. Ensure data quality, integrity, and security across our data platform...

Promoted
Acceler8 Talent
CA, United States

If you're a Software Engineer passionate about pretraining data and creating efficient, robust data pipelines, this role is for you. As a Software Engineer specializing in pretraining data, you will develop and optimize web scraping techniques to handle massive, multimodal datasets. Software Enginee...

OpenAI
San Francisco, California

The systems we support include our data warehouse, batch compute infrastructure, streaming infrastructure, data orchestration system, data lake, vector databases, critical integrations, and more. You’ll join the team that’s behind OpenAI’s data infrastructure that powers critical engineering, produc...

CriticalRiver Inc
San Francisco, California

Data Modeling, Data Architecture, Data Platforms, Analytics. Strong proficiency in data modeling and understanding of relational database concepts. SQL & DBT, Snowflake, AWS, Data Bricks. Develop and maintain data models to support business needs. ...

TWILIO
San Francisco, California
Remote

Deep technical understanding of ETL tools, low-latency data stores, multiple data warehouses and data catalogs. Collaborate with senior leadership to align data engineering strategies with organizational goals. Oversee the design, construction, testing, and maintenance of advanced, scalable data arc...

Blue Shield of California
CA, United States

The Data Engineering development team Is responsible for design, development, test and deployment of large enterprise data warehouse/BI data solutions using both on-prem and cloud technologies. The Data Engineer, Consultant will report to the Manager, Data Engineering Development. In this role you w...

Jobs via eFinancialCareers
San Francisco, California

Working closely across groups, such as the product, engineering, data science, and external partners for data modeling, general management of data life cycle, data governance and processes for meeting regulatory and legal requirements. Collaborating with business teams on the design, deployment and ...

Next Phase Systems
San Mateo, California

Data Engineer -Job Location-San Mateo, CA: develop data models, & platforms, data mining, assemble data sets Reqd. ...

Mindlance
San Leandro, California

Job Description: In this contingent resource assignment, you may: Consult on complex initiatives with broad impact and large-scale planning for Software Engineering. Review and analyze complex multi-faceted, larger scale or longer-term Software Engineering challenges that require in-depth evaluation...