Data Engineer

Cyborgwave
Columbus, OH, US
Full-time

Job Description

Job Description

Title : Data Engineer (Python, PySpark and AWS)

Location : Columbus, OH

Client : Mphasis / JPMC

Type : Contract

Position summary

A Data Engineer at CMS is a software engineer with proficiency in data. The data engineer will build and maintain the CMS data warehouse which is used for both reporting and analytics across the company.

The individual works cross functionally with technical and business teams to identify opportunities to better leverage data.

The data comes from a variety of sources and it is the responsibility of the data engineer to make sense of the data using cloud based systems (AWS) and provide a reliable and structured format to meet the different business needs at CMS.

Duties and responsibilities

Collaborate with the team to build out features for the data platform and consolidate data assets

Build, maintain and optimize data pipelines built using Spark

Advise, consult, and coach other data professionals on standards and practices

Work with the team to define company data assets

Migrate CMS’ data platform into Chase’s environment

Partner with business analysts and solutions architects to develop technical architectures for strategic enterprise projects and initiatives

Build libraries to standardize how we process data

Loves to teach and learn, and knows that continuous learning is the cornerstone of every successful engineer

Has a solid understanding of AWS tools such as EMR or Glue, their pros and cons and is able to intelligently convey such knowledge

Implement automation on applicable processes

The ideal candidate

5+ years of experience in a data engineering position

Proficiency is Python (or similar) and SQL

Strong experience building data pipelines with Spark

Strong verbal written communication

Strong analytical and problem solving skills

Experience with relational datastores, NoSQL datastores and cloud object stores

Experience building data processing infrastructure in AWS

Bonus : Experience with infrastructure as code solutions, preferably Terraform

Bonus : Cloud certification

Bonus : Production experience with ACID compliant formats such as Hudi, Iceberg or Delta Lake

Bonus : Familiar with data observability solutions, data governance frameworks

18 days ago
Related jobs
Promoted
Manifest Solutions
Westerville, Ohio

Understanding of the benefits of data warehousing, data architecture, data quality processes, data warehousing design and implementation, table structure, fact and dimension tables, logical and physical database design, data modeling, reporting process metadata, and ETL processes. Multimodal Data Ma...

Promoted
Canonical - Jobs
Columbus, Ohio

The data platform team is a collaborative team that develops a full range of data stores and data technologies, spanning from big data, through NoSQL, cache-layer capabilities, and analytics; all the way to structured SQL engines. The data platform team is responsible for the automation of data plat...

SynergisticIT
Columbus, Ohio

Java Full stack developers, Python/Java developers, Data analysts/ Data Scientists, Machine Learning engineers. ...

Pfizer
Remote, Ohio, United States
Remote

Expert-level experience in data architecture, data types/formats, optimizing code and ETL jobs, data architecture, data modeling, data contracts - a key requirement for successful collaboration with data architects and visualization developers. The team charter includes supporting Data Science and I...

WELLS FARGO BANK
Columbus, Ohio

Wells Fargo is seeking a Lead Information Security Engineer in Technology as a part of Chief Technology Office. Work with partner engineering teams on identification and remediation of security vulnerabilities and may also conduct risk assessments of infrastructure to ensure compliance with corporat...

Vantage Data Centers
New Albany, Ohio

Vantage is looking for an ambitious and self-starting Senior Project Engineer to drive core project coordination efforts across a wide spectrum of ongoing construction projects for the Ohio market. Bachelor of Science in Construction Management, Architecture, Engineering, or similar field, or equiva...

QTS Data Centers
New Albany, Ohio

Configuration of DCIM system applications in support of new data center deployments, new data requirements, and data center retrofits. BS in Electrical Engineering, Engineering Technology, or other related Engineering degree or equivalent professional experience. DCIM Engineer II / Controls Engineer...

Amazon
Lockbourne, Ohio

Amazon Senior Data Engineer - Lockbourne, Washington. Come build the future as a Senior Data Engineer at Amazon, where you will be inspired working alongside best-in-class inventors and innovators! You will have the opportunity to create meaningful experiences that deliver on the ever-evolving needs...

Crowe
Columbus, Ohio

We are seeking a skilled Data Science Engineer who will be responsible for overseeing the maintenance and improvement of current machine learning implementations within our Revenue Cycle line of business. You will play an important role in building out the data science practice within the engineerin...

Belcan
Hilliard, Ohio

The Data Center Technical Operations Engineer, Facility will be responsible for Data Center Engineering Operations within the Client Data Center including risk management and mitigation, corrective and preventative maintenance of critical infrastructure, vendor management and metric reporting. Data ...