Talent.com
Machine Learning Data Engineer - Systems & Retrieval
Machine Learning Data Engineer - Systems & RetrievalZyphra Technologies Inc. • Palo Alto, CA, United States
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

Zyphra Technologies Inc. • Palo Alto, CA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Zyphra is an artificial intelligence company based in Palo Alto, California.

The Role :

As a Machine Learning Data Engineer - Systems & Retrieval , you will build and optimize the data infrastructure that fuels our machine learning systems. This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw web-scale data to enterprise document corpora. You’ll play a central role in architecting retrieval systems for LLMs and enabling scalable training and inference with clean, accessible, and secure data. You’ll have an impact across both research and product teams by shaping the foundation upon which intelligent systems are trained, retrieved, and reasoned over.

You’ll work across :

Design and implementation of distributed data ingestion and transformation pipelines

Building retrieval and indexing systems that support RAG and other LLM-based methods

Mining and organizing large unstructured datasets, both in research and production environments

Collaborating with ML engineers, systems engineers, and DevOps to scale pipelines and observability

Ensuring compliance and access control in data handling, with security and auditability in mind

Requirements :

Strong software engineering background with fluency in Python

Experience designing, building, and maintaining data pipelines in production environments

Deep understanding of data structures, storage formats, and distributed data systems

Familiarity with indexing and retrieval techniques for large-scale document corpora

Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics

Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)

Excellent debugging, observability, and logging practices to support reliability at scale

Strong communication skills and experience collaborating across ML, infra, and product teams

Bonus Skill Set :

Experience building or maintaining LLM-integrated retrieval systems (e.g, RAG pipelines)

Academic or industry background in data mining, search, recommendation systems, or IR literature

Experience with large-scale ETL systems and tools like Apache Beam, Spark, or similar

Familiarity with vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding-based retrieval

Understanding of data validation and quality assurance in machine learning workflows

Experience working on cross-functional infra and MLOps teams

Knowledge of how data infrastructure supports training pipelines, inference serving, and feedback loops

Comfort working across raw, unstructured data, structured databases, and model-ready formats

Why Work at Zyphra :

Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued

We strongly value new and crazy ideas and are very willing to bet big on new ideas

We move as quickly as we can; we aim to minimize the bar to impact as low as possible

We all enjoy what we do and love discussing AI

Benefits and Perks :

Comprehensive medical, dental, vision, and FSA plans

Competitive compensation and 401(k)

Relocation and immigration support on a case-by-case basis

On-site meals prepared by a dedicated culinary team; Thursday Happy Hours

In-person team in Palo Alto, CA, with a collaborative, high-energy environment

If you're excited by the challenge of high-scale, high-performance data engineering in the context of cutting-edge AI, you’ll thrive in this role. Apply Today!

#J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Machine Learning Engineer • Palo Alto, CA, United States

Job_description.internal_linking.related_jobs
Machine Learning Engineer, Recommendation

Machine Learning Engineer, Recommendation

NewsBreak • Mountain View, CA, US
serp_jobs.job_card.full_time
Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy.With over 40 million monthly active users, our flagship platform delivers highly personalized loca...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Machine Learning Engineer, Distributed Systems, Optimus

Machine Learning Engineer, Distributed Systems, Optimus

Tesla • Palo Alto, CA, United States
serp_jobs.job_card.full_time
Machine Learning Engineer, Distributed Systems, Optimus.Machine Learning Engineer, Distributed Systems, Optimus.Machine Learning Engineer, Distributed Systems, Optimus. Machine Learning Engineer, Di...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer, GenAI Applied ML

Machine Learning Engineer, GenAI Applied ML

Scale AI, Inc. • San Francisco, CA, United States
serp_jobs.job_card.full_time
At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer

Machine Learning Engineer

Cisco Systems, Inc. • San Jose, CA, United States
serp_jobs.job_card.full_time
Applications are accepted until further notice.The Cisco's AI Research team consists of AI research scientists, data scientists, and network engineers with subject matter expertise who collaborate ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Machine Learning Systems Platform Engineer

Machine Learning Systems Platform Engineer

Blue Signal • San Francisco, CA, United States
serp_jobs.job_card.full_time
Confidential Opening : Machine Learning Systems Platform Engineer.San Francisco, CA (Hybrid Preferred).A stealth-mode innovator at the forefront of AI infrastructure is seeking a dynamic Machine Lea...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

ZipRecruiter • San Francisco, CA, United States
serp_jobs.job_card.full_time
Machine Learning Engineer, Training Infrastructure.We are looking for an ML Engineer with 3+ years of experience in high-performance computing systems to manage and optimize our computational infra...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

Zyphra • Palo Alto, CA, US
serp_jobs.job_card.full_time
Machine Learning Data Engineer - Systems & Retrieval.This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Systems Engineer, RL Engineering

Machine Learning Systems Engineer, RL Engineering

Menlo Ventures • San Francisco, CA, United States
serp_jobs.job_card.full_time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Systems Engineer, Encodings and Tokenization

Machine Learning Systems Engineer, Encodings and Tokenization

Anthropic • San Francisco, CA, United States
serp_jobs.job_card.full_time
Machine Learning Systems Engineer, Encodings and Tokenization.Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users a...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer, Synthetic Data

Machine Learning Engineer, Synthetic Data

Australian Competition and Consumer Commission • Mountain View, CA, United States
serp_jobs.job_card.full_time
Software Autonomy Sensing Mountain View, California.Aurora’s mission is to deliver the benefits of self-driving technology safely, quickly, and broadly. The Aurora Driver will create a new era in mo...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Research Engineer, Enterprise GenAI

Machine Learning Research Engineer, Enterprise GenAI

Scale AI, Inc. • San Francisco, CA, United States
serp_jobs.job_card.full_time
AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
ML Research Engineer, ML Systems

ML Research Engineer, ML Systems

Scale AI, Inc. • San Francisco, CA, United States
serp_jobs.job_card.full_time
Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Applied AI / ML Engineer

Applied AI / ML Engineer

Catalyst Labs • Menlo Park, CA, US
serp_jobs.job_card.full_time
Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science.We stand out as an agency thats deeply embedded in our clients recruitment ope...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Machine Learning Systems Engineer, RL Engineering

Machine Learning Systems Engineer, RL Engineering

Anthropic • San Francisco, CA, United States
serp_jobs.job_card.full_time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Machine Learning Engineer

Senior Machine Learning Engineer

Meltwater • Redwood City, CA, United States
serp_jobs.job_card.full_time
Meltwater's Consumer Intelligence AI Team is looking for a.Natural Language Processing or Computer Vision features relying on the literature's state of the art. Those features are meant to be integr...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
AI Applications and Data Science Engineer

AI Applications and Data Science Engineer

CXApp US, Inc. • San Ramon, CA, US
serp_jobs.job_card.full_time
CXAPP is a forward-thinking technology company that leverages AI and data science to drive innovation and deliver cutting-edge solutions. We are seeking talented AI Applications and Data Science Eng...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Machine Learning Engineer, Data

Machine Learning Engineer, Data

IntelliPro Group Inc. • San Francisco, CA, US
serp_jobs.job_card.full_time
Machine Learning Engineer, Data.We are looking for an ML Engineer with 3+ YOE designing, building, and maintaining data pipelines at scale. The ideal candidate has diverse experi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Research Engineer - Machine Learning & Systems

Research Engineer - Machine Learning & Systems

World Labs • San Francisco, CA, United States
serp_jobs.job_card.full_time
We are looking for a versatile Research Engineer with a strong background in machine learning or 3D, software development, and systems design. This role is ideal for someone excited about bridging c...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted