Data Engineer

CDC Foundation
Ohio
Full-time

Overview

The Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization.

This role is aligned to the Workforce Acceleration Initiative (WAI). WAI is a federally funded CDC Foundation program with the goal of helping the nation’s public health agencies by providing them with the technology and data experts they need to accelerate their information system improvements.

Working within Cleveland Department of Public Health (CDPH). The Data Engineer is responsible for enabling data integration and data preparation pipelines for downstream analytics on behalf of the Office of Epidemiology and Population Health.

This role requires business intuition and ability to use a variety of technical and soft skills necessary to collaborate across departments.

The Data Engineer will be hired by the CDC Foundation and assigned to the Epidemiologist responsible for informatics in the Office of Epidemiology and Population Health (OEPH).

The Data Engineer will additionally cooperate with the Office of Urban Analytics & Innovation (Urban AI) at the City of Cleveland for alignment on data infrastructure requirements and best practices for the enterprise.

This position is eligible for a fully remote work arrangement for based candidates.

Responsibilities

  • Utilize software engineering methods and tools on a common data analytic platform to integrate, process and prepare multiple sources of data for downstream public health surveillance analyses.
  • Collaborate with the Data Analyst and Epidemiologists to understand data requirements, develop and maintain data pipelines automating data transformation tasks.
  • Perform data linkages between public health surveillance data and geospatial data assets.
  • Document data transformation processes and maintain comprehensive records for reproducibility.
  • Test data and / or applications to validate data accuracy / quality
  • Track projects from conceptualization to completion, including helping to create project roadmaps, project plans and requirements documentation
  • Create and manage the systems and pipelines that enable efficient and reliable flow of data, including ingestion, processing, and storage.
  • Collect data from various sources, transforming and cleaning it to ensure accuracy and consistency. Load data into storage systems or data warehouses.
  • Optimize data pipelines, infrastructure, and workflows for performance and scalability.
  • Monitor data pipelines and systems for performance issues, errors, and anomalies, and implement solutions to address them.
  • Implement security measures to protect sensitive information.
  • Collaborate with data scientists, analysts, and other partners to understand their data needs and requirements, and to ensure that the data infrastructure supports the organization's goals and objectives.
  • Collaborate with cross-functional teams to understand data requirements and design scalable solutions that meet business needs.
  • Implement and maintain ETL processes to ensure the accuracy, completeness, and consistency of data.
  • Design and manage data storage systems, including relational databases, NoSQL databases, and data warehouses.
  • Knowledgeable about industry trends, best practices, and emerging technologies in data engineering, and incorporating the trends into the organization's data infrastructure.
  • Provide technical guidance to other staff.
  • Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings.

Qualifications

  • Bachelor's degree in computer science or information systems, or equivalent experience
  • Demonstrated ability in complex data management and data preparation, including but not limited to data storage, data standardization, and data operations, for data warehousing efforts
  • Experience working with data integration frameworks
  • Experience working with cloud services & infrastructure (Microsoft Azure Databricks preferred)
  • Experience in designing, writing, and delivering code in a team environment, using source code control, unit testing, and other software engineering principles (, Java, Python, R)
  • Ability to thrive in a project-based, team environment
  • Preferred Skills
  • Spatial data experience, geopandas or ArcGIS

Special Notes This role is involved in a dynamic public health program. As such, roles and responsibilities are subject to change as situations evolve.

Roles and responsibilities listed above may be expanded upon or updated to match priorities and needs, once written approval is received by the CDC Foundation in order to best support the public health programming.

7 hours ago
Related jobs
Promoted
Hispanic Technology Executive Council
Delaware, Ohio

Citis Global Data Center Critical Systems Engineering team provides logistical planning for the technology organization in the Data Center environment and other critical technology environments. The Critical Systems Engineering team is the Data Center Management structure responsible for capacity ma...

Promoted
Olsson
Columbus, Ohio
Remote

We are Olsson, a team-based, purpose-driven engineering and design firm. As a Senior Electrical Engineer, you will work directly with some of the world’s largest technology companies and other mission-critical clients. You will serve as an electrical engineer on projects, design calculations, ...

Promoted
U.S. Bank
Cincinnati, Ohio

Bancorp is hiring a Data Access Governance Engineer who implements and supports Data Access Governance security solutions to protect data/assets from unauthorized access, use, disclosure, destruction, modification, or disruption. Data Access Governance Engineer. ...

Promoted
DICE
Cincinnati, Ohio

Role: Data Engineer - Datastage, DBT, SnowflakeLocation: Cincinnati, OH or Chicago, IL or Charlotte, NC (Locals preferred). Client is seeking a qualified Data Engineer to fill an open position with one of our banking clients. Proficient in SQL and strong data engineering fundamentals. Conceptual und...

Promoted
NVIDIA
Hamilton, Ohio

NVIDIA is hiring a Senior Software Engineer to work on improving the data ingestion platform in our Autonomous Driving division. Enhance and scale our data ingestion pipelines to ensure a continuous and reliable flow of AV data into the system. Monitor data pipelines and services to ensure data avai...

GEICO
Cincinnati, Ohio
Remote

Experience developing new and enhancing existing data processing (Data Ingest, Data Transformation, Data Store, Data Management, Data Quality) components. Data processing/data transformation using ETL/ELT tools such as DBT (Data Build Tool), or Databricks. Our Senior Data Engineer is a key member of...

Amazon Data Services, Inc.
New Albany, Ohio

As a Data Center Controls Engineer you will:. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. You’ll join a diverse team of software, hardware, and network engine...

Olsson
Columbus, Ohio
Remote

We are Olsson, a team-based, purpose-driven engineering and design firm. As a Senior Electrical Engineer, you will work directly with some of the world’s largest technology companies and other mission-critical clients. You will serve as an electrical engineer on projects, design calculations, write ...

Tek Ninjas
Findlay, Ohio

Collaborate with data architects, data engineers, and software developers to ensure effective data collection, processing, and storage. Machine Learning Engineer / Data Scientist to join our dynamic Data Science and AI team. Experience with both relational and non-relational databases, time-series d...

The Hartford
Columbus, Ohio

This is an exciting opportunity in Enterprise Data Services for an experienced Staff Software Engineer with a strong background in Platform Administration and Support, executing within the Agile operating model. We are looking for a candidate who is passionate about technology, platform management a...