Talent.com
Principal Data Engineer

Principal Data Engineer

SanasPalo Alto, CA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Overview

Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity. Our GDP-shifting technology sets a gold standard and is backed by seasoned founders with a track record of guiding unicorns.

Sanas is a 200-strong team, established in 2020. We have secured over $100 million in funding and partner with leading investors, including Insight Partners, Google Ventures, Quadrille Capital, General Catalyst, and Quiet Capital. We collaborate with Fortune 100 companies and are shaping the future of communication.

We’re looking for an experienced and forward-thinking Principal Data Engineer to lead the design and implementation of our end-to-end data infrastructure for industry-leading Voice AI products. This high-impact role shapes the technical vision, owns strategic architecture decisions, and mentors a growing team of Data engineers focused on delivering reliable and scalable data systems for Machine Learning at scale.

You’ll work cross-functionally with AI research scientists, infrastructure, and product teams to ensure that data—from raw audio to training-ready features—is consistently accessible, compliant, and optimized for speed and scale. You’ll help push the boundaries of real-time Voice AI!

Key Responsibilities

  • Architect and lead the development of large-scale data pipelines and data lakes to ingest, transform and serve high-quality data for AI model training, product telemetry, and analytics.
  • Drive long-term data infrastructure strategy across streaming and batch, feature store extensions, Iceberg / Delta lake choices, metadata management, and lakehouse evolution.
  • Drive platform and infrastructure decisions, optimizing compute fleets (e.g., Ray, Spark clusters), orchestration tooling (Airflow, Dagster), and streaming stacks (Kafka, Flink).
  • Collaborate with AI research scientists, engineering leads, product, finance, marketing, and legal to align data architecture with business and regulatory requirements.
  • Advocate best practices in data governance, lineage, observability, testing, tooling, and disaster recovery across pipelines and data stores.
  • Act as a mentor and technical leader—review design and code, share patterns, elevate team capability, and support recruitment and hiring.
  • Drive build vs buy decisions for tools to implement data quality and observability solutions to achieve high data quality.

Qualifications

  • 10+ years of experience in Data Engineering, Infrastructure, or ML Systems, with at least 2+ years in a technical leadership capacity.
  • Expertise in building distributed batch and real-time data systems.
  • Expertise in databases (like PostgreSQL) and data lakes (like Snowflake, Databricks, and ClickHouse).
  • Experience using data processing frameworks like Spark, Flink, and Ray.
  • Deep experience with cloud platforms (AWS / GCP), object storage (e.g., S3), and orchestrators like Airflow and Dagster.
  • Strong knowledge of data lifecycle management, including privacy, security, compliance, and reproducibility.
  • Comfortable working in a fast-paced startup environment.
  • Strategic mindset and proven ability to collaborate across engineering, ML, and product teams to deliver scalable infrastructure.
  • Nice to Have

  • Familiarity with audio data and its challenges, such as large file sizes and time-series features, metadata handling.
  • Experience with Voice AI models like ASR, TTS, and speaker verification.
  • Familiarity with real-time data processing frameworks like Kafka, Flink, Druid, and Pinot.
  • Familiarity with ML workflows including MLOps, feature engineering, model training, and inference.
  • Experience with labeling tools, audio annotation platforms, or human-in-the-loop annotation pipelines.
  • Joining us means contributing to the world’s first real-time speech understanding platform revolutionizing Contact Centers and Enterprises alike.

    Our technology empowers agents, transforms customer experiences, and drives measurable growth. You’ll be part of a team exploring the vast potential of an increasingly sonic future.

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Principal Data Engineer • Palo Alto, CA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Principal Data Engineer

    Principal Data Engineer

    StartXPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity.Pioneered by season...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Data Engineer - Databricks, Data Platforms

    Principal Data Engineer - Databricks, Data Platforms

    Internetwork ExpertSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Simple Machines ANZ – Job Ad – Principal Data Engineer.Position : Principal Data Engineer.Location : Darlinghurst, Sydney. Simple Machines is a leading independent boutique technology firm with a glob...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Enterprise Application Engineer

    Principal Enterprise Application Engineer

    FyrFly Venture PartnersPleasanton, CA, United States
    serp_jobs.job_card.full_time
    Principal Enterprise Application Engineer.The Principal Enterprise Application Engineer specializes in D365 Finance and Operations (F&O), focusing on designing, developing, and maintaining complex ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Data Scientist

    Principal Data Scientist

    I did my part and supported the Regular ToiletSan Jose, CA, United States
    serp_jobs.job_card.full_time
    Changing the world through digital experiences is what Adobe’s all about.We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital exper...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Principal Sales and Solutions Engineer

    Principal Sales and Solutions Engineer

    UdemySan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Udemy is an AI-powered skills acceleration platform built to help people and teams grow.It's personalized, practical, and focused on real-world impact. Our mission is simple : to transform lives thro...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Data Scientist (10042)

    Principal Data Scientist (10042)

    Extreme Networks, Inc.San Jose, CA, United States
    serp_jobs.job_card.full_time
    Principal Data Scientist – (Gen AI, Machine Learning).Are you energized by the idea of innovating with Generative AI? Do you want to create global impact while tackling challenges at the forefront ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Data Infrastructure Engineer

    Principal Data Infrastructure Engineer

    fabric IncSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    We’re a team of dedicated experts creating a new way to commerce for the age of AI Shopping.AI Commerce Operating System to orchestrate, optimize, and scale unified commerce for everyone.It’s a sys...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Tubi TvSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users.Tubi offers the world's largest collection of Hollywood movies and TV shows, th...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Data Engineer Engineering

    Data Engineer Engineering

    Delphi AI Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Delphi, we are redefining how knowledge is shared by creating a new medium for human communication : interactive digital minds that people can talk to, learn from, and be guided by.The internet g...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Solutions Engineer

    Principal Solutions Engineer

    AtlassianSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Atlassian offers flexibility in where they work – office, home, or a combination.Interviews and onboarding are conducted virtually, as part of being a distributed-first company.We can hire people i...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Data Engineer, Analytics

    Data Engineer, Analytics

    OpenAISan Francisco, CA, United States
    serp_jobs.job_card.full_time
    The Applied team works across research, engineering, product, and design to bring OpenAI’s technology to consumers and businesses. We seek to learn from deployment and distribute the benefits of AI,...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Data Scientist

    Principal Data Scientist

    NorthbeamSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Northbeam is building the world's most advanced marketing intelligence platform, providing top eCommerce brands a unified view of their business data through powerful attribution modeling and custo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    AI Systems & Data Engineer

    AI Systems & Data Engineer

    HyperFiSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connec...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal, DAA Programs (Data, Analytics, and AI)

    Principal, DAA Programs (Data, Analytics, and AI)

    SnowflakeMenlo Park, CA, US
    serp_jobs.job_card.full_time
    We're looking for a sharp, strategic operator to join our team as Principal, DAA Programsa critical role at the heart of Snowflake's data, analytics, and AI transformation.Reporting directly to the...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior / Principal Software Engineer – Data Pipelines & Performance

    Senior / Principal Software Engineer – Data Pipelines & Performance

    xage, incPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    Senior / Principal Software Engineer – Data Pipelines & Performance.Senior / Principal Software Engineer – Data Pipelines & Performance. Xage is the first and only zero trust real-world security com...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Data Infrastructure Engineer

    Principal Data Infrastructure Engineer

    fabricSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Principal Data Infrastructure Engineer.We’re a team of dedicated experts creating a new way to commerce for the age of AI Shopping. AI Commerce Operating System to orchestrate, optimize, and scale u...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Data Platform Engineer

    Senior Data Platform Engineer

    Ellipsis HealthSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Ellipsis Health is creating cutting-edge AI / ML products that solve healthcare staffing issues and administrative burdens using conversational AI and our patented voice biomarker technology in the d...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    5026 Principal Engineer R&D

    5026 Principal Engineer R&D

    Toshiba America Electronic Components, IncSan Jose, CA, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Summary The San Jose Storage Design Center (SDC) is part of Toshiba America Electronic Components, Inc.We are responsible for the design and development of hard disk drives (HDDs), solving interest...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30