Xpanse - Gen AI Data Engineer

Harrison National Employment, LLC (Archwell Holdings, LLC)
FL, US
Permanent
Quick Apply

Xpanse was established with the goal of increasing access to home ownership for a broader audience. Since our launch, we've been dedicated to simplifying the mortgage lending ecosystem by building innovative software solutions.

We view home ownership as a core component of the 'American Dream,' and our products play a key role in transforming that dream into reality.

We are seeking a skilled Data Engineer to join our team in building advanced data pipelines and infrastructure to support Generative AI (GenAI) applications.

You will play a key role in designing and implementing scalable data systems that enable the efficient processing and storage of large datasets, ensuring that our AI models have access to clean, well-structured, and high-quality data.

This is an opportunity to contribute to the backbone of AI-driven solutions, such as call summarization, data compliance, and predictive analytics.

Job Requirements :

  • Data Pipeline Development : Design, build, and maintain robust and scalable data pipelines that support GenAI applications, including real-time and batch data processing.
  • Data Integration : Work closely with data scientists, machine learning engineers, and software developers to integrate structured and unstructured data from various sources, ensuring it’

s clean and ready for use in AI models.

  • Cloud Infrastructure : Deploy and manage data storage solutions on AWS, Azure, GCP, and Snowflake, optimizing for scalability, performance, and cost-efficiency.
  • Data Quality Management : Implement and monitor data quality checks to ensure data accuracy, completeness, and consistency across the entire data pipeline.
  • ETL Processes : Design and manage ETL (Extract, Transform, Load) processes that efficiently handle large datasets from different data sources, transforming data into formats suitable for analysis.
  • Collaboration : Work with cross-functional teams to understand business requirements and translate them into efficient data models and pipelines.
  • Performance Optimization : Continuously monitor and optimize data storage, retrieval, and transformation processes to ensure high performance and low latency.
  • Security & Compliance : Ensure data pipelines meet security and regulatory requirements, especially in areas like data privacy and compliance in financial services.

Qualifications :

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  • 3+ years of experience in data engineering or a related role, with a strong focus on building and managing data pipelines.
  • Expertise in data pipeline orchestration tools such as Apache Airflow, AWS Glue, Dataflow, or similar technologies.
  • Proficiency in working with ETL frameworks and building data transformation processes.
  • Strong experience with SQL and NoSQL databases, and knowledge of modern data storage solutions like Snowflake, S3, Redshift, BigQuery, or similar.
  • Cloud Platform Experience : Strong experience working on cloud platforms (AWS, Azure, GCP) and familiarity with managing data infrastructure in a cloud-native environment.
  • Knowledge of Python, Scala, or Java for data manipulation and automation tasks.
  • Strong understanding of data security, governance, and compliance best practices.
  • Experience working with large-scale data systems supporting AI / ML models, including vector databases and unstructured data management, preferred.
  • Familiarity with MLOps and integrating data pipelines into machine learning workflows, preferred.
  • Experience with real-time data processing tools like Kafka or Kinesis, preferred.
  • Experience optimizing data architecture to support large datasets for AI / ML-driven applications, preferred.
  • Snowflake expertise : Extensive experience building and optimizing data models, managing data workflows, and ensuring scalability within Snowflake, preferred.
  • Strong problem-solving skills and the ability to work in a fast-paced, collaborative environment, preferred.
  • 1 day ago
Related jobs
Harrison National Employment, LLC (Archwell Holdings, LLC)
FL, US

We're on the lookout for a detail-oriented Quality Assurance (QA) Engineer who specializes in ensuring the highest standards of quality for next-generation Generative AI (GenAI) chatbot applications. Automated Testing Development: Design, build, and maintain automated testing frameworks tailored to ...

PwC US Group LLP
Miami, Florida
Remote

A Data Scientist collects domain context from stakeholders, defines hypothesis and prediction tasks, identifies and creates supporting data sources, conducts experiments with various algorithms to model prediction tasks, undertakes validation and tests of models to improve performance, produces pipe...

Harrison National Employment, LLC (Archwell Holdings, LLC)
FL, US

We're seeking a highly innovative Data Scientist to lead the development of advanced Generative AI (GenAI) applications at our company. Oversee data collection, preprocessing, and management, maintaining data integrity for model training. Keep abreast of AI, ML, and GenAI advancements to continually...

JPMorgan Chase & Co.
Tampa, Florida

Apply advanced principles, theories, and concepts in the realm of Artificial Intelligence (AI), Machine Learning (ML), Large Language Models (LLMs), Deep Learning (DL), Generative AI, Transfer Learning, and Reinforcement Learning algorithms to cyber data sets . Extract and analyze data from JPMC dat...

Harrison National Employment, LLC (Archwell Holdings, LLC)
FL, US

We are seeking a highly skilled Full Stack Engineer with a keen eye for UI/UX design to take a leading role in developing next-generation Generative AI (GenAI) chatbots. As a key member of our interdisciplinary team, which includes Product Managers, UX Designers, Data Scientists, and Machine Learnin...

Promoted
InsideHigherEd
Gainesville, Florida

Expertise will be necessary to design and develop scripts to transform data from multiple data sources into primary data collection centers and make the data analytical-ready. Knowledge and expertise will be needed in integrating, filtering and extracting multivariate and multimodal clinical databas...

Promoted
INSPYR Solutions
Deerfield Beach, Florida

Sales Operations Business Intelligence Analyst (PowerBI, SQL). ...

Promoted
Cognizant Technology Solutions
Tampa, Florida
Remote

Certified Information Systems Security Professional (CISSP) Google Professional Cloud DevOps Engineer Microsoft Certified: Azure DevOps Engineer Expert AWS Certified DevOps Engineer. Implement AWS DevOps practices to enhance cloud infrastructure and application performance. ...

Promoted
VirtualVocations
Lakeland, Florida

A company is looking for a Senior Oracle PL/SQL Developer. ...

Promoted
D Aceto Services LLC
Miami, Florida

D Aceto Services LLC is seeking a motivated and detail-oriented Entry-Level Data Analyst to join our team. Help maintain data integrity and accuracy within databases. In this remote position, you will work closely with various departments to analyze data, generate insights, and support decision-maki...