Data Engineer

CDC Foundation
Ohio
Full-time

Overview

The Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization.

This role is aligned to the Workforce Acceleration Initiative (WAI). WAI is a federally funded CDC Foundation program with the goal of helping the nation’s public health agencies by providing them with the technology and data experts they need to accelerate their information system improvements.

Working within Cleveland Department of Public Health (CDPH). The Data Engineer is responsible for enabling data integration and data preparation pipelines for downstream analytics on behalf of the Office of Epidemiology and Population Health.

This role requires business intuition and ability to use a variety of technical and soft skills necessary to collaborate across departments.

The Data Engineer will be hired by the CDC Foundation and assigned to the Epidemiologist responsible for informatics in the Office of Epidemiology and Population Health (OEPH).

The Data Engineer will additionally cooperate with the Office of Urban Analytics & Innovation (Urban AI) at the City of Cleveland for alignment on data infrastructure requirements and best practices for the enterprise.

This position is eligible for a fully remote work arrangement for based candidates.

Responsibilities

  • Utilize software engineering methods and tools on a common data analytic platform to integrate, process and prepare multiple sources of data for downstream public health surveillance analyses.
  • Collaborate with the Data Analyst and Epidemiologists to understand data requirements, develop and maintain data pipelines automating data transformation tasks.
  • Perform data linkages between public health surveillance data and geospatial data assets.
  • Document data transformation processes and maintain comprehensive records for reproducibility.
  • Test data and / or applications to validate data accuracy / quality
  • Track projects from conceptualization to completion, including helping to create project roadmaps, project plans and requirements documentation
  • Create and manage the systems and pipelines that enable efficient and reliable flow of data, including ingestion, processing, and storage.
  • Collect data from various sources, transforming and cleaning it to ensure accuracy and consistency. Load data into storage systems or data warehouses.
  • Optimize data pipelines, infrastructure, and workflows for performance and scalability.
  • Monitor data pipelines and systems for performance issues, errors, and anomalies, and implement solutions to address them.
  • Implement security measures to protect sensitive information.
  • Collaborate with data scientists, analysts, and other partners to understand their data needs and requirements, and to ensure that the data infrastructure supports the organization's goals and objectives.
  • Collaborate with cross-functional teams to understand data requirements and design scalable solutions that meet business needs.
  • Implement and maintain ETL processes to ensure the accuracy, completeness, and consistency of data.
  • Design and manage data storage systems, including relational databases, NoSQL databases, and data warehouses.
  • Knowledgeable about industry trends, best practices, and emerging technologies in data engineering, and incorporating the trends into the organization's data infrastructure.
  • Provide technical guidance to other staff.
  • Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings.

Qualifications

  • Bachelor's degree in computer science or information systems, or equivalent experience
  • Demonstrated ability in complex data management and data preparation, including but not limited to data storage, data standardization, and data operations, for data warehousing efforts
  • Experience working with data integration frameworks
  • Experience working with cloud services & infrastructure (Microsoft Azure Databricks preferred)
  • Experience in designing, writing, and delivering code in a team environment, using source code control, unit testing, and other software engineering principles (, Java, Python, R)
  • Ability to thrive in a project-based, team environment
  • Preferred Skills
  • Spatial data experience, geopandas or ArcGIS

Special Notes This role is involved in a dynamic public health program. As such, roles and responsibilities are subject to change as situations evolve.

Roles and responsibilities listed above may be expanded upon or updated to match priorities and needs, once written approval is received by the CDC Foundation in order to best support the public health programming.

5 hours ago
Related jobs
Promoted
Pkaza
New Albany, Ohio

Owner's Rep - Field Engineer - Data Center - New Albany, OH. Our client is a Nationally ranked MEP Engineering Design firm that is a leader in the data center space. They provide design, commissioning, consulting and management expertise in Data Center / Mission Critical Facilities Space with th...

Promoted
Caddell Construction
OH, United States

Plans, develops, coordinates and manages onsite construction engineering activities for Commercial projects. Four-year degree in engineering or construction management preferred. ...

Promoted
VSoft Consulting Group inc
Cincinnati, Ohio

Data Engineer and has helped/lead development teams in delivering high-quality data orchestration solutions with min 7+ years’ experience. Job Title: Lead Azure Data Engineer (SC Product Funding). Azure data factory, data bricks, and CICD. Technical skill:- Python, PySpark, ADF, SQL, ADLS, Microsoft...

Promoted
CoStrategix
Blue Ash, Ohio

Collaborate with data engineers, data scientists, and other cross-functional teams to integrate data quality checks into the data pipeline and maintain quality throughout. As a Data Quality Engineer, you will play a critical role in developing and implementing data quality standards, processes, and ...

Promoted
Matlen Silver
Cincinnati, Ohio

Data Engineer and has helped/lead development teams in delivering high-quality data orchestration solutions with min 7+ years’ experience. Supply Chain Data Engineering. The ideal candidate should excel in Development or pipeline, orchestrating data, resolving connection issue, Solve production issu...

JPMorgan Chase Bank, N.A.
Columbus, Ohio

Proactively identifies hidden problems and patterns in data and uses these insights to drive improvements to coding hygiene and system architecture * Contributes to software engineering communities of practice and events that explore new and emerging technologies * Adds to team c...

Vertiv
Westerville, Ohio

Understanding of the benefits of data warehousing, data architecture, data quality processes, data warehousing design and implementation, table structure, fact and dimension tables, logical and physical database design, data modeling, reporting process metadata, and ETL processes. Understanding of e...

JPMorgan Chase & Co.
Columbus, Ohio

Extract and analyze data from JPMC data sources, evaluating the effectiveness and precision of new data sources and data collection techniques . Create bespoke data models and algorithms tailored to address Cyber Technology Group's requirements and apply them to data sets while developing and employ...

Ohio's Hospice of Dayton
Troy, Ohio

As a Data Engineer I, you will be responsible for supporting the data infrastructure and pipelines within the organization. Collaborate with data scientists and analysts to understand data requirements and ensure data quality. What you should know about the Data Engineer 1 position: . You will work ...

Wendy's
Dublin, Ohio

Experience with scalable cloud data technologies (Google Cloud preferred), examples include Dataflow, Dataproc, Dataflow, Dataprep, Cloud Composer, Big Query, Snowflake, Looker, Databricks, Hadoop, Spark, Snaplogic. The primary focus of this role is to lead the design and development of data movemen...