Talent.com
Software Engineer, Data
Software Engineer, DataImbue • San Francisco, CA, US
Software Engineer, Data

Software Engineer, Data

Imbue • San Francisco, CA, US
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Description

Job Description

About us

Our first product, Sculptor, gives engineers the power to run and coordinate multiple coding agents in parallel, helping them move faster, and stay in flow. At its core, Sculptor is about giving developers more power to make software creation faster, safer, and more reliable.

More broadly, our mission is to make "open" AI agents win over closed agents. To do that, we’re starting with coding agents, since they both make it easier to construct future agents, and we can use them ourselves. The insights we gain from these agents will help us improve both the capabilities of the underlying models and the interaction design for agents. Ultimately, we aim to rekindle the dream of the

  • personal
  • computer, where computers become truly intelligent tools that empower us, giving us freedom, dignity, and agency to pursue the things we love.

Summary

We’re a small, cross-functional team focused on building AI systems that reason and code. We care deeply about understanding how people interact with these systems and how we can use data to make them safer, smarter, and more useful .

We're looking for a Data Engineer to build and own the pipelines and data infrastructure that power our product and research efforts. Your work will directly support model training, evaluation, product analytics, and safety systems. You’ll partner closely with team members building our coding agents to make sure we’re capturing the right signals and using them well.

If you’re excited about turning messy product data into actionable insights, and building systems that can scale with our research, we’d love to get connected!

Example Projects

  • Combine synthetic data generation with human annotation platforms to produce high quality data that advances our product and research roadmap.
  • Design and build resilient, scalable pipelines (ETL and ELT) for batch and streaming data.
  • Develop and maintain infrastructure to support self-serve analytics, experimentation, and dataset generation. Prototype, evaluate, and make “build vs buy” decisions.
  • Help define and improve data modeling practices across the company, including instrumentation standards, dimensional modeling for analytics and feature stores for machine learning (ML).
  • Build integrations with ML infrastructure to support training pipelines, inference logging, and model monitoring (MLOps).
  • Debug pipeline failures, automate deployment processes, and improve data quality and reusability.
  • You are

  • A strong software engineer with 5+ years of experience, ideally working with large-scale data systems.
  • Experienced in designing and maintaining data pipelines and infrastructure, especially for analytics, experimentation, and ML.
  • Comfortable with tools for data orchestration (Airflow, Prefect), batch or streaming processing (Spark, Ray, Flink), and event tracking and analytics (Amplitude, PostHog).
  • Experienced with cloud-based infrastructure and storage (e.g., S3, GCP, Snowflake, or Redshift), and thoughtful about cost-performance tradeoffs.
  • Exposure to MLOps, model serving infrastructure, or ML workflows.
  • Pragmatic and principled! You know when to optimize and when to ship.
  • Compensation and Benefits

  • Competitive compensation, equity, and benefits
  • Lunch provided daily to onsite employees
  • $250 lifestyle stipend per month
  • Generous budget for self-improvement : coaching, courses, conferences, etc
  • Actively co-create and participate in a positive, intentional team culture
  • Spend time learning, reading papers, and deeply understanding prior work.
  • Frequent team events, dinners, off-sites, and hanging out.
  • Compensation packages are highly variable based on a variety of factors. If your salary requirements fall outside of the stated range, we still encourage you to apply. The range for this role is $170,000–$350,000 cash, $10,000–$2,000,000 in equity.
  • How to apply

    All submissions are reviewed by a person, so we encourage you to include notes on why you're interested in working with us. If you have any other work that you can showcase (open source code, side projects, etc.), certainly include it! We know that talent comes from many backgrounds, and we aim to build a team with diverse skillsets that spike strongly in different areas.

    We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

    serp_jobs.job_alerts.create_a_job

    Software Engineer Data • San Francisco, CA, US

    Job_description.internal_linking.related_jobs
    Senior Software Engineer, Big Data

    Senior Software Engineer, Big Data

    ZipRecruiter • Palo Alto, CA, US
    serp_jobs.job_card.full_time
    We offer a hybrid work environment.Most US-based positions can also.To actively connect people to their next great opportunity. ZipRecruiter is a leading online employment marketplace.Powered by AI-...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Data Recovery Engineer - Windows Platform

    Data Recovery Engineer - Windows Platform

    DriveSavers Data Recovery • Novato, CA, US
    serp_jobs.job_card.full_time
    Seeking a candidate with 1-2 years of IT / Desktop Support and troubleshooting experience on the Windows PC platform who is excited to learn the art of data recovery. Associate / Bachelor Degree or eq...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. / Staff Software Engineer, Big Data

    Sr. / Staff Software Engineer, Big Data

    Predactiv • Palo Alto, CA, US
    serp_jobs.job_card.full_time
    ShareThis, a Predactiv Company is a big data company that owns online behavior data of 1b+ users globally.We are developing an audience intelligence platform with cutting edge big data technologies...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Software Engineer (Data)

    Staff Software Engineer (Data)

    Amigo • San Francisco, CA, US
    serp_jobs.job_card.full_time
    Amigo builds trust and safety infrastructure for AI in mission-critical environments.We partner with organizations in healthcare and other regulated sectors to deploy AI systems that operate reliab...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer - Distributed Data Systems

    Software Engineer - Distributed Data Systems

    xAI • Palo Alto, CA, US
    serp_jobs.job_card.full_time
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, Data Application

    Software Engineer, Data Application

    NewsBreak • Mountain View, CA, US
    serp_jobs.job_card.full_time
    Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy.With over 40 million monthly active users, our flagship platform delivers highly personalized loca...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Distributed Data Systems (US)

    Software Engineer, Distributed Data Systems (US)

    Onehouse • Sunnyvale, CA, US
    serp_jobs.job_card.full_time
    Onehouse is a mission-driven company dedicated to freeing data from data platform lock-in.We deliver the industry’s most interoperable data lakehouse through a cloud-native managed service bu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer - Data Production

    Software Engineer - Data Production

    Replica Inc. • Oakland, CA, US
    serp_jobs.job_card.full_time
    Software Engineer (Data Production Team).San Francisco, New York, Or Kansas City.Replica is a privacy-centric urban data platform that delivers critical insights about the built environment.With be...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Incubator - Data Engineer

    AI Incubator - Data Engineer

    Sprinter Health • Menlo Park, CA, US
    serp_jobs.job_card.full_time
    At Sprinter Health, our mission is reimagining how people access care by bringing it directly to their homes.Nearly 30% of patients in the U. For many, the ER becomes their first touchpoint with the...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, Platform - Berkeley, USA

    Software Engineer, Platform - Berkeley, USA

    Speechify • Berkeley, CA, US
    serp_jobs.job_card.full_time
    The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, Platform - Vallejo, USA

    Software Engineer, Platform - Vallejo, USA

    Speechify • Vallejo, CA, US
    serp_jobs.job_card.full_time
    The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, Data

    Software Engineer, Data

    ZipHQ, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    The simple task of buying software, services, or tools at work has become hopelessly complicated at even the most innovative companies in the world. Today, enterprises spend $120T+ per year globally...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff Software Engineer - Data Platform

    Staff Software Engineer - Data Platform

    Hive • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Every day, we process data from millions of KM from 10s of thousands of high resolution sensors deployed around the world. A symphony of different Sensor Fusion, ML, AI, and 3D Sensing processes are...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff Software Engineer, Data Platform

    Staff Software Engineer, Data Platform

    Social Finance, Inc. (SoFi) • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Shape a brighter financial future with us.Together with our members, we’re changing the way people think about and interact with personal finance. We’re a next-generation financial services company ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Data Engineer

    Data Engineer

    SteerBridge • Miramar, CA, US
    serp_jobs.job_card.full_time
    SteerBridge Strategies is a CVE-Verified Service-Disabled, Veteran-Owned Small Business (SDVOSB) delivering a broad spectrum of professional services to the U. Backed by decades of hands-on experien...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer - Data Streaming

    Software Engineer - Data Streaming

    TigerGraph • Redwood City, CA, US
    serp_jobs.job_card.full_time
    TigerGraph is a platform for advanced analytics and machine learning on connected data.TigerGraph's core technology is the only scalable graph database for the enterprise.Its proven technology ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Data Engineer

    Data Engineer

    The Rockridge Group • Emeryville, CA, US
    serp_jobs.job_card.full_time
    Google Search console experience required.Google Tag Manager, merchant account or data studio experience preferred.Facebook knowledge will be big plus. Very proficient with installation, data interr...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer - Data Platform

    Software Engineer - Data Platform

    xAI • Palo Alto, CA, US
    serp_jobs.job_card.full_time
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted