Search jobs > Cincinnati, OH > Senior data engineer

Senior Data Engineer

Gravity IT Resources
Cincinnati, OH, United States
Full-time

Job Summary : We are seeking three skilled Data Engineer to join our Data Science team. The ideal candidate will be responsible for designing, building, and maintaining scalable data pipelines and infrastructure to support data analytics, machine learning, and Retrieval-Augmented Generation (RAG) type Large Language Model (LLM) workflows.

This role requires a strong technical background, excellent problem-solving skills, and the ability to work collaboratively with data scientists, analysts, and other stakeholders.

Key Responsibilities :

  • Data Pipeline Development :
  • Design, develop, and maintain robust and scalable ETL (Extract, Transform, Load) processes.
  • Ensure data is collected, processed, and stored efficiently and accurately.
  • Data Integration :
  • Integrate data from various sources, including databases, APIs, and third-party data providers.
  • Ensure data consistency and integrity across different systems.
  • RAG Type LLM Workflows :
  • Develop and maintain data pipelines specifically tailored for Retrieval-Augmented Generation (RAG) type Large Language Model (LLM) workflows.
  • Ensure efficient data retrieval and augmentation processes to support LLM training and inference.
  • Collaborate with data scientists to optimize data pipelines for LLM performance and accuracy.
  • Semantic / Ontology Data Layers :
  • Develop and maintain semantic and ontology data layers to enhance data integration and retrieval.
  • Ensure data is semantically enriched to support advanced analytics and machine learning models.
  • Collaboration :
  • Work closely with data scientists, analysts, and other stakeholders to understand data requirements and deliver solutions.
  • Provide technical support and guidance on data-related issues.
  • Data Quality and Governance :
  • Implement data quality checks and validation processes to ensure data accuracy and reliability.
  • Adhere to data governance policies and best practices.
  • Performance Optimization :
  • Monitor and optimize the performance of data pipelines and infrastructure.
  • Troubleshoot and resolve data-related issues in a timely manner.
  • Support for Analysis :
  • Support short-term ad-hoc analysis by providing quick and reliable data access.
  • Contribute to longer-term goals by developing scalable and maintainable data solutions.
  • Documentation :
  • Maintain comprehensive documentation of data pipelines, processes, and infrastructure.
  • Ensure knowledge transfer and continuity within the team.

Technical Requirements :

  • Education and Experience :
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  • 3+ years of experience in data engineering or a related role.
  • Technical Skills :
  • Proficiency in Python (mandatory).
  • Experience with other programming languages such as Java or Scala is a plus.
  • Experience with SQL and NoSQL databases (e.g., MySQL, PostgreSQL, MongoDB).
  • Familiarity with big data technologies (e.g., Hadoop, Spark, Kafka).
  • Experience with cloud platforms (e.g., AWS, Azure, Google Cloud) and their data services.
  • RAG Type LLM Skills :
  • Experience with data pipelines for LLM workflows, including data retrieval and augmentation.
  • Familiarity with natural language processing (NLP) techniques and tools.
  • Understanding of LLM architectures and their data requirements.
  • Semantic / Ontology Data Layers :
  • Familiarity with semantic and ontology data layers and their application in data integration and retrieval.
  • Tools and Frameworks :
  • Experience with ETL tools and frameworks (e.g., Apache NiFi, Airflow, Talend).
  • Familiarity with data visualization tools (e.g., Tableau, Power BI) is a plus.
  • Soft Skills :
  • Strong analytical and problem-solving skills.
  • Excellent communication and collaboration abilities.
  • Ability to work in a fast-paced, dynamic environment.

Preferred Qualifications :

  • Experience with machine learning and data science workflows.
  • Knowledge of data governance and compliance standards.
  • Certification in cloud platforms or data engineering.
  • 12 days ago
Related jobs
Highmark Health
OH, Working at Home, Ohio

In partnership with other business, platform, technology, and analytic teams across the enterprise, design, build and maintain well-engineered data solutions in a variety of environments, including traditional data warehouses, Big Data solutions, and cloud-oriented platforms. Align with security, da...

Huntington National Bank
Ohio

The Data Protection Engineer Senior will independently perform Data Protection engineering activities of building, configuring, troubleshooting, integrating and administrating Data protection technologies aligned to one of the Data Protection sub-domains (Data in Transit, Data at Rest, Cryptographic...

BDO
Cincinnati, Ohio

Net, C#, Qlik, Power BI, Machine Learning, Azure Data Factory, RedShift, UiPath, Cloud, RPA, AWS, Redshift, Kinesis, QuickSight, SageMaker, S3, Databricks, AWS Lake Formation, Snowflake, Python, Qlik, Athena, Data Pipeline, Glue, Star Schema, Data Modeling, SQL, SSIS, SSAS, SSRS, PySpark, Microsoft ...

Wounded Warrior Project
Cincinnati, Ohio

The Wounded Warrior Project (WWP) Senior Data Engineer is a member of the Web, Data, and Analytics team responsible for data engineering and programming to build systems that collect, manage, and convert raw data into usable information for business analysts. Build required infrastructure for optima...

LeadStack Inc.
Cincinnati, Ohio

Experience with Cosmos DB, Azure Data Explorer, Azure Synapse Analytics, Azure Data Lake, Azure Data Factory, Azure SQL, Azure Databricks, Azure Machine Learning or equivalent tools & technologies. Analyze, design and develop enterprise data and information architecture deliverables, focusing on dat...

Standard Aero
Cincinnati, Ohio

Senior Data Analytics Engineer or Specialist,. Collaborate with TAG Engineers, Engineering Directors/Managers, and other functional groups in order to:. Assist with data mining needed to formulate & support value proposition for new projects based on data from Navixa, SA Menu etc. Create skills trai...

The Judge Group
Cincinnati, Ohio

We are seeking a highly skilled and experienced Senior Data Engineer with a passion for data and expertise in Domain-Driven Design (DDD) to join our organization. In this role, you will be responsible for the technical development and maintenance of our data systems, mentoring junior data engineers,...

VSoft Consulting Group inc
Cincinnati, Ohio

Data Engineer and has helped/lead development teams in delivering high-quality data orchestration solutions with min 7+ years’ experience. Job Title: Lead Azure Data Engineer (SC Product Funding). Azure data factory, data bricks, and CICD. Technical skill:- Python, PySpark, ADF, SQL, ADLS, Microsoft...

MEDPACE
Cincinnati, Ohio

Medpace is a full-service clinical contract research organization (CRO).We provide Phase I-IV clinical development services to the biotechnology, pharmaceutical and medical device industries.Our mission is to accelerate the global development of safe and effective medical therapeutics through its sc...

Flexton Inc.
Cincinnati, Ohio

Work with Data Engineers and other stakeholders for. Data quality and data validations. DataWarehouse, Datamodeling background is a plus. Hands on with Python, Databricks pipeline, deployment and testing. ...