Talent.com
Principal Data Engineer

Principal Data Engineer

StartXPalo Alto, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Overview

Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity. Pioneered by seasoned startup founders with a proven track record of creating and steering multiple unicorn companies, our groundbreaking GDP-shifting technology sets a gold standard.

Sanas is a 200-strong team, established in 2020. In this short span, we’ve successfully secured over $100 million in funding. Our innovation has been supported by the industry’s leading investors, including Insight Partners, Google Ventures, Quadrille Capital, General Catalyst, Quiet Capital, and other influential investors. Our reputation is further solidified by collaborations with numerous Fortune 100 companies. With Sanas, you’re not just adopting a product; you’re investing in the future of communication.

Role

We’re looking for an experienced and forward-thinking Principal Data Engineer to lead the design and implementation of our end-to-end data infrastructure for industry-leading Voice AI products. This is a high-impact role where you will shape the technical vision, own strategic architecture decisions, and mentor a growing team of Data engineers focused on delivering reliable and scalable data systems for Machine Learning at scale.

You’ll work cross-functionally with AI research scientists, infrastructure and product teams to ensure that data—from raw audio to training-ready features—is consistently accessible, compliant and optimized for speed and scale. You’ll help push the boundaries of real-time Voice AI!

Key Responsibilities

  • Architect and lead the development of large-scale data pipelines and data lakes to ingest, transform and serve high-quality data for AI model training, product telemetry and analytics.
  • Drive long-term data infrastructure strategy across streaming and batch, feature store extensions, Iceberg / Delta lake choices, metadata management, and lakehouse evolution.
  • Drive platform and infrastructure decisions, optimizing compute fleets (e.g., Ray, Spark clusters), orchestration tooling (Airflow, Dagster), and streaming stacks (Kafka, Flink).
  • Collaborate with AI research scientists, engineering leads, product, finance, marketing, and legal to align data architecture with business and regulatory requirements.
  • Advocate best practices in data governance, lineage, observability, testing, tooling, and disaster recovery across pipelines and data stores.
  • Act as a mentor and technical leader—review design and code, share patterns, elevate team capability, and support recruitment and hiring.
  • Drive build vs buy decisions for tools to implement data quality and observability solutions to achieve high data quality.

Qualifications

  • 10+ years of experience in Data Engineering, Infrastructure, or ML Systems, with at least 2+ years in a technical leadership capacity.
  • Expertise in building distributed batch and real-time data systems.
  • Expertise in databases (like Postgres) and data lakes (like Snowflake, Databricks and ClickHouse).
  • Experience using data processing frameworks like Spark, Flink and Ray.
  • Deep experience with cloud platforms AWS / GCP, object storage (e.g., S3), and orchestration tools like Airflow and Dagster.
  • Strong knowledge of data lifecycle management, including privacy, security, compliance and reproducibility.
  • Comfortable working in a fast-paced startup environment.
  • Strategic mindset and proven ability to collaborate across engineering, ML and product teams to deliver infrastructure that scales with the business.
  • Nice to Have

  • Familiarity with audio data and its unique challenges, like large file sizes and time-series features; metadata handling is a strong plus.
  • Experience with Voice AI models like ASR, TTS and speaker verification.
  • Familiarity with real-time data processing frameworks like Kafka, Flink, Druid and Pinot.
  • Familiarity with ML workflows including MLOps, feature engineering, model training and inference.
  • Experience with labeling tools, audio annotation platforms, or human-in-the-loop annotation pipelines.
  • Joining us means contributing to the world’s first real-time speech understanding platform revolutionizing Contact Centers and Enterprises alike.

    Our technology empowers agents, transforms customer experiences, and drives measurable growth. But this is just the beginning. You'll be part of a team exploring the vast potential of an increasingly sonic future.

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Principal Data Engineer • Palo Alto, CA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Principal Data Engineer - Databricks, Data Platforms

    Principal Data Engineer - Databricks, Data Platforms

    Internetwork ExpertSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Simple Machines ANZ – Job Ad – Principal Data Engineer.Position : Principal Data Engineer.Location : Darlinghurst, Sydney. Simple Machines is a leading independent boutique technology firm with a glob...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Enterprise Application Engineer

    Principal Enterprise Application Engineer

    FyrFly Venture PartnersPleasanton, CA, United States
    serp_jobs.job_card.full_time
    Principal Enterprise Application Engineer.The Principal Enterprise Application Engineer specializes in D365 Finance and Operations (F&O), focusing on designing, developing, and maintaining complex ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Sales and Solutions Engineer

    Principal Sales and Solutions Engineer

    UdemySan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Udemy is an AI-powered skills acceleration platform built to help people and teams grow.It's personalized, practical, and focused on real-world impact. Our mission is simple : to transform lives thro...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Data Engineer

    Data Engineer

    CGSSan Francisco, California, United States, 94102
    serp_jobs.job_card.full_time
    Employment Type : Full-Time, Mid-level.Department : Business Intelligence.CGS is seeking a passionate and driven Data Engineer to support a rapidly growing Data Analytics and Business Intelligence pl...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Tubi TvSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users.Tubi offers the world's largest collection of Hollywood movies and TV shows, th...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Associate Principal Engineer - AI

    Associate Principal Engineer - AI

    ExelixisAlameda, CA, United States
    serp_jobs.job_card.full_time
    As a Senior AI Architect, you will play a pivotal role in developing and implementing our enterprise AI strategy.You will be responsible for designing and delivering sophisticated AI solutions, hig...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Solutions Engineer

    Principal Solutions Engineer

    AtlassianSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Atlassian offers flexibility in where they work – office, home, or a combination.Interviews and onboarding are conducted virtually, as part of being a distributed-first company.We can hire people i...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Data Engineer

    Principal Data Engineer

    SanasPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity.Our GDP-shifting te...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Platform Engineer (DLP)

    Principal Platform Engineer (DLP)

    Palo Alto NetworksSanta Clara, CA, US
    serp_jobs.job_card.full_time
    At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer a...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Principal Data Infrastructure Engineer

    Principal Data Infrastructure Engineer

    fabric IncSan Francisco, California, United States, 94102
    serp_jobs.job_card.full_time
    We're a team of dedicated experts creating a new way to commerce for the age of AI Shopping.AI Commerce Operating System to orchestrate, optimize, and scale unified commerce for everyone.It's a sys...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal, DAA Programs (Data, Analytics, and AI)

    Principal, DAA Programs (Data, Analytics, and AI)

    SnowflakeMenlo Park, CA, US
    serp_jobs.job_card.full_time
    We're looking for a sharp, strategic operator to join our team as Principal, DAA Programsa critical role at the heart of Snowflake's data, analytics, and AI transformation.Reporting directly to the...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Sr. Software Engineer - Data Engineer

    Sr. Software Engineer - Data Engineer

    FyrFly Venture PartnersPleasanton, CA, United States
    serp_jobs.job_card.full_time
    Software Engineer - Data Engineer.Senior Software Engineer – Data Engineer.This role will focus on building and optimizing data pipelines, integrating multiple data sources, and enabling advanced a...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Solutions Engineer

    Principal Solutions Engineer

    FreshworksSan Mateo, CA, US
    serp_jobs.job_card.full_time
    Organizations everywhere struggle under the crushing costs and complexities of “solutions” that promise to simplify their lives. To create a better experience for their customers and emp...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior / Principal Software Engineer – Data Pipelines & Performance

    Senior / Principal Software Engineer – Data Pipelines & Performance

    xage, incPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    Senior / Principal Software Engineer – Data Pipelines & Performance.Senior / Principal Software Engineer – Data Pipelines & Performance. Xage is the first and only zero trust real-world security com...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Black OreSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Black Ore is building the leading AI platform for financial services.By combining LLMs, proprietary AI / ML and automation we accelerate core workflows for the industry, allow financial services prof...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Data Infrastructure Engineer

    Principal Data Infrastructure Engineer

    fabricSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Principal Data Infrastructure Engineer.We’re a team of dedicated experts creating a new way to commerce for the age of AI Shopping. AI Commerce Operating System to orchestrate, optimize, and scale u...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Data Platform Engineer

    Senior Data Platform Engineer

    Ellipsis HealthSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Ellipsis Health is creating cutting-edge AI / ML products that solve healthcare staffing issues and administrative burdens using conversational AI and our patented voice biomarker technology in the d...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Sales and Solutions Engineer

    Principal Sales and Solutions Engineer

    BEDI PartnershipsSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Udemy is an AI-powered skills acceleration platform built to help people and teams grow.It’s personalized, practical, and focused on real-world impact. Our mission is to transform lives through lear...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days