Search jobs > Grapevine, TX > Cloud data engineer

Cloud Data Engineer with Databricks

Diaconia LLC
Statewide, VA, USA
$120K-$140K a year
Full-time
Quick Apply

Diaconia is looking for a talented Cloud Data Engineer to join our Amazing team!

If you're looking to join a company that truly appreciates you and your talents, look no further! At Diaconia, we are committed to serving and caring for our colleagues, our clients and our community.

Our team is made up of talented individuals who appreciate having the opportunity to contribute their knowledge and experience to further the growth and development of our industry.

Our ideal candidates embrace diverse thinking, enjoy partnering with others and are seeking to make a difference!

We are currently searching for a new, full-time member for our team for the position of :

Cloud Data Engineer with Databricks

U.S. Citizenship is REQUIRED

A Cloud Data Engineer with Databricks experience and U.S. Citizenship required per Federal Requirements for the Consumer Financial Protection Bureau (CFPB) is responsible for managing, developing, and maintaining data infrastructure and pipelines, with a specific focus on Databricks, to support the agency's data-driven initiatives and regulatory responsibilities.

The CFPB is a U.S. government agency tasked with regulating and overseeing financial products and services to protect consumers.

Job Title : Cloud Data Engineer with Databricks

Job Summary : The Cloud Data Engineer with Databricks at the Consumer Financial Protection Bureau (CFPB) is responsible for designing, building, and maintaining data infrastructure and ETL pipelines using Databricks and cloud-based technologies.

The role plays a crucial part in enabling data-driven decision-making and ensuring data accuracy and accessibility for regulatory purposes.

Key Responsibilities :

  • Collaborate & contribute to the architecture, design, development, and maintenance of large-scale data & analytics platforms, system integrations, data pipelines, data models & API integrations.
  • Prototype emerging business use cases to validate technology approaches and propose potential solutions.
  • Data Pipeline Development : Design, develop, and maintain data pipelines using Databricks, Apache Spark, and other cloud-based technologies to ingest, transform, and load data from various financial institutions and sources.
  • Data Transformation : Implement data transformation processes to ensure data quality, integrity, and consistency, meeting regulatory standards.

Create transformation path for data to migrate from on-prem pipelines and sources to AWS.

  • Data Integration : Integrate data from diverse sources, including financial databases, APIs, regulatory reporting systems, and internal data stores, into the CFPB's data ecosystem.
  • Data Modeling : Develop and optimize data models for regulatory analysis, reporting, and compliance, following data warehousing and data lake principles.
  • Performance Optimization : Monitor and optimize data pipelines for efficiency, scalability, and cost-effectiveness while ensuring data privacy and security.
  • Data Governance : Ensure data governance and regulatory compliance, maintaining data lineage and documentation for audits and reporting purposes.
  • Collaboration : Collaborate with cross-functional teams, including data analysts, legal experts, and regulatory specialists, to understand data requirements and provide data support for regulatory investigations.
  • Documentation : Maintain comprehensive documentation for data pipelines, code, and infrastructure configurations, adhering to regulatory compliance standards.
  • Troubleshooting : Identify and resolve data-related issues, errors, and anomalies to ensure data reliability and compliance with regulatory requirements.
  • Continuous Learning : Stay updated with regulatory changes, industry trends, cloud technologies, and Databricks advancements to implement best practices and improvements in data engineering.

Disclaimer "The responsibilities and duties outlined in this job description are intended to describe the general nature and level of work performed by employees within this role.

However, they are not exhaustive and may be subject to change or modification at any time to meet the evolving needs of the organization

Minimum Qualifications :

  • U.S. Citizens ONLY as per Federal Requirements
  • B achelor's or higher degree in computer science, data engineering, or a related field.
  • The Databricks Certified Data Engineer Professional certification is required. NO EXCEPTIONs!
  • U.S. Citizenship is required. NO EXCEPTIONS!
  • Minimum of 3 years of experience in the following :
  • Strong understanding of data lake, lakehouse, and data warehousing architectures in a cloud-based environment.
  • Hands-on experience with Databricks including data ingestion, transformation, and analysis
  • Proficiency in Python for data manipulation, scripting, and automation
  • In-depth knowledge of AWS services relevant to data engineering such as Amazon S3, EC2, Database Migration Service (DMS), DataSync, EKS, CLI, RDS, Lambda, etc.
  • Understanding of data integration patterns and technologies.
  • Proficiency designing and building flexible and scalable ETL processes and data pipelines using Python and / or PySpark and SQL.
  • Proficiency in data pipeline automation and workflow management tools like Apache Airflow or AWS Step Functions.
  • Knowledge of data quality management and data governance principles.
  • Strong problem-solving and troubleshooting skills related to data management challenges.
  • Experience managing code in GitHub or other similar tools.
  • Experience leveraging Postgres in a parallel processing environment.
  • Hands-on experience migrating from an on-premise data platform(s) to a modern cloud environment (e.g. AWS, Azure, GCP).
  • Excellent problem-solving and communication skills.
  • Strong attention to detail and the ability to work independently and collaboratively.

Preferred Skills :

  • Experience with financial data or regulatory data management.
  • Experience working in Agile or DevSecOps environments and using related tools for collaboration and version control.
  • Knowledge of regulatory frameworks in the financial industry.
  • Familiarity with DevOps and CI / CD practices.
  • Experience with machine learning and AI technologies.

Clearance requirements :

  • Must be able to obtain and maintain a Public Trust clearance.
  • Must be a verifiable a US Citizen for this Federal support position.

A Cloud Data Engineer with Databricks at the CFPB plays a vital role in ensuring data accuracy, integrity, and compliance with regulatory standards, supporting the agency's mission to protect consumers in the financial sector.

The role demands expertise in Databricks, cloud technologies, and a deep understanding of data engineering principles within a regulatory context.

Applicant selected will be subject to a government security investigation and must meet eligibility requirements for access to classified information.

Diaconia is an Equal Opportunity Employer, Minorities / Females / Veterans / Disabled. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin.

11 hours ago
Related jobs
Promoted
VirtualVocations
Irving, Texas

A company is looking for a Cloud Data Engineer to lead data management and engineering projects in cloud environments. ...

Diaconia LLC
Grapevine, Texas

The Cloud Data Engineer with Databricks at the Consumer Financial Protection Bureau (CFPB) is responsible for designing, building, and maintaining data infrastructure and ETL pipelines using Databricks and cloud-based technologies. A Cloud Data Engineer with Databricks at the CFPB plays a vital role...

Tek Ninjas
Fort Worth, Texas

Collaborating with data engineers and analysts to acquire, clean, and transform data for reporting purposes. Position: Data Engineer with Power BI. Implement best practices for database security and compliance with data privacy regulations, incorporating access controls, encryption, and auditing mec...

Highmark Health
TX, Working at Home, Texas

This role within the 'Data Engineering & Self-Service Products' team involves architecting and engineering analytic data solutions, including designing and developing data marts in Databricks using PySpark or Spark SQL, building interactive Power BI dashboards to visualize KPIs and trends, and creat...

Motion Recruitment
Lewisville, Texas

Are you a Data Engineer with experience in Healthcare Data? At our company, we focus on leveraging clinical knowledge and technology to develop solutions that simplify and enhance healthcare delivery for providers and organizations. Healthcare data experience - working with claims (Payer) and clinic...

Tek Ninjas
Carrollton, Texas

Looking for Cloud Big Data Engineer in Carrollton, TX area and should work with Amazon Web Services (AWS) using EC2 for computing and S3 as storage. Test all applications and transport data to target Warehouse tables, schedule and run extraction and load process by using Informatica Workflow Ma...

Motion Recruitment
Lewisville, Texas

We are looking for a mid-senior level Data Engineer with healthcare experience to join our team in Lewisville, Tx. Do you have a healthcare background in clinical data sets and a passion for enhancing patient and provider experience? In this role, you will be tasked with extracting/pulling data, pro...

Accord Tecnologies.Inc
Fort Worth, Texas

Develop and implement custom solutions using Power Apps and Power Automate Power Platform (Canvas/model-driven apps, SharePoint, Power BI) Relational databases (design/SQL/tuning) Web APIs (SOAP/REST) Tooling (GIT/JIRA/Azure DevOps) Gathering and understanding business requiremen...

ITL USA
Texas, US

At least 2 years of experience designing and implementing data pipelines using Azure Databricks for data cleaning, transformation, and loading into Azure Synapse AnalyticsHands on experience in end-to-end implementation of data warehouse and data martsExperience in writing SQL queries to analysis Ty...

Peyton Resource Group
Fort Worth, Texas

As an Azure Cloud Data Engineer you will join the team as a hands-on technologist to work on our data initiatives supporting current on-premises solution and enable the evolution to the cloud. Data Engineering experience with strong ETL skills . Azure) cloud native architecture, ETL/ELT, and dat...