Talent.com
Data Engineer (Founding Team)
Data Engineer (Founding Team)Fabrion • Bodega Bay, CA, US
Data Engineer (Founding Team)

Data Engineer (Founding Team)

Fabrion • Bodega Bay, CA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Description

Job Description

Data / ETL Engineer (Founding Team)

Location : San Francisco Bay Area

Type : Full-Time

Compensation : Competitive salary + early-stage equity

Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems.

About the Role

We’re building a multi-tenant, AI-native platform where enterprise data becomes actionable through semantic enrichment, intelligent agents, and governed interoperability. At the heart of this architecture lies our Data Fabric — an intelligent, governed layer that turns fragmented and siloed data into a connected ontology ready for model training, vector search, and insight-to-action workflows.

We're looking for engineers who enjoy hard data problems at scale : messy unstructured data, schema drift, multi-source joins, security models, and AI-ready semantic enrichment. You’ll build the backend systems, data pipelines, connector frameworks, and graph-based knowledge models that fuel agentic applications.

If you've worked on streaming unstructured pipelines, built connectors into ugly legacy systems, or mapped knowledge graphs that scale — this role will feel like home.

Responsibilities

Build highly reliable, scalable data ingestion and transformation pipelines across structured, semi-structured, and unstructured data sources

Develop and maintain a connector framework for ingesting from enterprise systems (ERPs, PLMs, CRMs, legacy data stores, email, Excel, docs, etc.)

Design and maintain the data fabric layer — including a knowledge graph (Neo4j or Puppygraph) enriched with ontologies, metadata, and relationships

Normalize and vectorize data for downstream AI / LLM workflows — enabling retrieval-augmented generation (RAG), summarization, and alerting

Create and manage data contracts, access layers, lineage, and governance mechanisms

Build and expose secure APIs for downstream services, agents, and users to query enriched semantic data

Collaborate with ML / LLM teams to feed high-quality enterprise data into model training and tuning pipelines

What We’re Looking For

Core Experience :

5+ years building large-scale data infrastructure in production environments

Deep experience with ingestion frameworks (Kafka, Airbyte, Meltano, Fivetran) and data pipeline orchestration (Airflow, Dagster, Prefect)

Comfortable processing unstructured data formats : PDFs, Excel, emails, logs, CSVs, web APIs

Experience working with columnar stores, object storage, and lakehouse formats (Iceberg, Delta, Parquet)

Strong background in knowledge graphs or semantic modeling (e.g. Neo4j, RDF, Gremlin, Puppygraph)

Familiarity with GraphQL, RESTful APIs, and designing developer-friendly data access layers

Experience implementing data governance : RBAC, ABAC, data contracts, lineage, data quality checks

Mindset & Culture Fit :

You’re a system thinker : you want to model the real world, not just process it

Comfortable navigating ambiguous data models and building from scratch

Passionate about enabling AI systems with real-world, messy enterprise data

Pragmatic about scalability, observability, and schema evolution

Value autonomy, high trust, and meaningful ownership over infrastructure

Bonus Skills

Prior work with vector DBs (e.g. Weaviate, Qdrant, Pinecone) and embedding pipelines

Experience building or contributing to enterprise connector ecosystems

Knowledge of ontology versioning , graph diffing , or semantic schema alignment

Familiarity with data fabric patterns (e.g. Palantir Ontology, Linked Data, W3C standards)

Familiar with fine-tuning LLMs or enabling RAG pipelines using enterprise knowledge

Experience enforcing data access policy with tools like OPA , Keycloak , Snowflake row-level security

Why This Role Matters

Agents are only as smart as the data they operate on. This role builds the foundation — the semantic, governed, connected substrate — that makes autonomous decision-making and agent action possible. From factory ERP records to geopolitical news alerts, the data fabric unifies it all.

If you're excited to tame complexity, unify chaos, and power intelligent systems with trusted data — we’d love to hear from you.

serp_jobs.job_alerts.create_a_job

Founding Engineer • Bodega Bay, CA, US

Job_description.internal_linking.related_jobs
Data Recovery Engineer - Windows Platform

Data Recovery Engineer - Windows Platform

DriveSavers Data Recovery • Novato, CA, US
serp_jobs.job_card.full_time
Seeking a candidate with 1-2 years of IT / Desktop Support and troubleshooting experience on the Windows PC platform who is excited to learn the art of data recovery. Associate / Bachelor Degree or eq...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Backend Engineer - Bay Area

Backend Engineer - Bay Area

Ema • Bodega Bay, CA, US
serp_jobs.job_card.full_time
Ema is building the next generation AI technology to empower every employee in the enterprise to be their most creative and productive. Our proprietary tech allows enterprises to delegate most repet...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Staff Systems Engineer

Staff Systems Engineer

Meet Life Sciences • Santa Rosa, CA, United States
serp_jobs.job_card.full_time
An exciting opportunity has become available within a surgical robotics company based out of the South Bay Area, CA.The company is developing an innovative surgical robotic platform that will revol...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Account Associate - State Farm Agent Team Member

Account Associate - State Farm Agent Team Member

Miguel Alfaro - State Farm Agent • Windsor, CA, US
serp_jobs.job_card.full_time
As Account Associate - State Farm Agent Team Member for Miguel Alfaro - State Farm Agent, you are vital to our daily business operations and customers success. You grow our agency through meaningful...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Inside Sales / Estimator

Inside Sales / Estimator

FASTSIGNS® of Windsor, CA - Sonoma County • Windsor, CA, US
serp_jobs.job_card.full_time
Do you have an appreciation for how important signage is to our lives? .Do your friends and co-workers refer to you as a people person? Have friends or people told you or suggested you go into...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Member of Technical Staff - Machine Learning

Member of Technical Staff - Machine Learning

Quantix Search • Santa Rosa, CA, United States
serp_jobs.job_card.full_time
Member of Technical Staff – Machine Learning.San Francisco | Hybrid, 3 days / week | $200K – $280K + equity.I’m partnering with a rapidly scaling healthtech startup that has just raised a $40M Series...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Data & Analytics Engineer

Senior Data & Analytics Engineer

ZipRecruiter • Bodega Bay, CA, US
serp_jobs.job_card.full_time
Job DescriptionJob Description.Headquartered in the Silicon Valley, Meshy is the leading 3D generative AI company on a mission to. Meshy makes it effortless for both professional artists and hobbyis...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior BI Solutions Analyst

Senior BI Solutions Analyst

The Pasha Group • San Rafael, CA, United States
serp_jobs.job_card.full_time
Information for California residents.Now Hiring : Senior Business Intelligence (BI) Solutions Analyst - Empower Data-Driven Decisions at The Pasha Group. For more than 75 years, we've been a trusted ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
AI Data Specialist

AI Data Specialist

Perplexity AI • Bodega Bay, CA, US
serp_jobs.job_card.full_time
Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world's leading AI platforms. Perplexity has raised over $1B in venture investment from some ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer - AI Agent Infrastructure (Healthcare)

Software Engineer - AI Agent Infrastructure (Healthcare)

Honey Health • Santa Rosa, CA, United States
serp_jobs.job_card.full_time
Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patient data, processing orders and prescri...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Data Engineer

Senior Data Engineer

SmithRx • Bodega Bay, CA, US
serp_jobs.job_card.full_time
SmithRx is a rapidly growing, venture-backed Health-Tech company.Our mission is to disrupt the expensive and inefficient Pharmacy Benefit Management (PBM) sector by building a next-generation drug ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer - Data Infrastructure (Pretraining Data)

Software Engineer - Data Infrastructure (Pretraining Data)

xAI • Bodega Bay, CA, US
serp_jobs.job_card.full_time
AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Manager, Business Analytics Solutions

Senior Manager, Business Analytics Solutions

The Pasha Group • San Rafael, CA, United States
serp_jobs.job_card.full_time
Information for California residents.Now Hiring : Senior Manager, Business Analytics Solutions - Lead Insight, Innovation, and Impact at The Pasha Group. For more than 75 years, we've been a trusted ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior / Staff Backend Engineer

Senior / Staff Backend Engineer

Termina • Santa Rosa, CA, United States
serp_jobs.job_card.full_time
Termina's goal is to accumulate first-party data for all Venture-backed companies in the world.Private-Markets investment decisions happen in the dark, uninformed, and in many cases without objecti...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Machine Learning Engineer

Senior Machine Learning Engineer

Harnham • Santa Rosa, CA, United States
serp_jobs.job_card.full_time
SENIOR MACHINE LEARNING ENGINEER - SEARCH & RECOMMENDATIONS.Hybrid – Bay Area (3 Days / Week Onsite).We’re a fast-growing online marketplace backed by a major global tech player.Our platform helps mi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
HRIS Senior Analyst

HRIS Senior Analyst

The Pasha Group • San Rafael, CA, United States
serp_jobs.job_card.full_time
Information for California residents.Now Hiring : HRIS Senior Analyst - Drive Data Integrity and System Excellence at The Pasha Group. For over 75 years, we've been a trusted leader in global transpo...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer, Platform - Santa Rosa, USA

Software Engineer, Platform - Santa Rosa, USA

Speechify • Santa Rosa, CA, US
serp_jobs.job_card.full_time
The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Customer Service Representative / Estimator

Customer Service Representative / Estimator

FASTSIGNS® of Windsor, CA - Sonoma County • Windsor, CA, US
serp_jobs.job_card.full_time
Do you have an appreciation for how important signage is to our lives? .Do your friends and co-workers refer to you as a people person? Have friends or people told you or suggested you go into...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted