Talent.com
Data Engineer
Data EngineerPeople Data Labs • San Francisco, CA, US
Data Engineer

Data Engineer

People Data Labs • San Francisco, CA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.permanent
job_description.job_card.job_description

Job Description

Job Description

Note for all engineering roles : with the rise of fake applicants and AI-enabled candidate fraud, we have built in additional measures throughout the process to identify such candidates and remove them.

About Us

People Data Labs (PDL) is the provider of people and company data. We do the heavy lifting of data collection and standardization so our customers can focus on building and scaling innovative, compliant data solutions. Our sole focus is on building the best data available by integrating thousands of compliantly sourced datasets into a single, developer-friendly source of truth. Leading companies across the world use PDL's workforce data to enrich recruiting platforms, power AI models, create custom audiences, and more.

We are looking for individuals who can balance extreme ownership with a "one-team, one-dream" mindset. Our customers are trying to solve complex problems, and we only help them achieve their goals as a team. Our Data Engineering Team is the secret sauce behind all that we do and we are looking for the best of the best.

If you are looking to be part of a team discovering the next frontier of data-as-a-service (DaaS) with a high level of autonomy and opportunity for direct contributions, this might be the role for you. We like our engineers to be thoughtful, quirky, and willing to fearlessly try new things. Failure is embraced at PDL as long as we continue to learn and grow from it.

What You Get to Do

  • Build infrastructure for ingestion, transformation, and loading an exponentially increasing volume of data from a variety of sources using Spark, SQL, AWS, and Databricks
  • Building an organic entity resolution framework capable of correctly merging hundreds of billions of individual entities into a number of clean, consumable datasets.
  • Developing CI / CD pipelines and anomaly detection systems capable of continuously improving the quality of data we're pushing into production.
  • Dreaming up solutions to largely undefined data engineering and data science problems.

The Technical Chops You'll Need

  • 4-6+ years of industry experience with clear examples of strategic technical problem-solving and implementation
  • Strong software development fundamentals.
  • Experience with Python
  • Expertise with Apache Spark (Java, Scala, and / or Python-based)
  • Experience with SQL
  • Experience building scalable data processing systems (e.g., cleaning, transformation) from the ground up.
  • Experience using developer-oriented data pipeline and workflow orchestration (e.g., Airflow (preferred), dbt, dagster or similar)
  • Knowledge of modern data design and storage patterns (e.g., incremental updating, partitioning and segmentation, rebuilds and backfills)
  • Experience working in Databricks (including delta live tables, data lakehouse patterns, etc.)
  • Experience with cloud computing services (AWS (preferred), GCP, Azure or similar)
  • Experience with data warehousing (e.g., Databricks, Snowflake, Redshift, BigQuery, or similar)
  • Understanding of modern data storage formats and tools (e.g., parquet, ORC, Avro, Delta Lake)
  • People Thrive Here Who Can

  • Balance high ownership and autonomy with a strong ability to collaborate
  • Work effectively remotely (able to be proactive about managing blockers, proactive on reaching out and asking questions, and participating in team activities)
  • Demonstrate strong written communication skills on Slack / Chat and in documents
  • Exhibt experience in writing data design docs (pipeline design, dataflow, schema design)
  • Scope and breakdown projects, communicate and collaborate progress and blockers effectively with your manager, team, and stakeholders
  • Some Nice To Haves

  • Degree in a quantitative discipline such as computer science, mathematics, statistics, or engineering
  • Experience working with entity data (entity resolution / record linkage)
  • Experience working with data acquisition / data integration
  • Expertise with Python and the Python data stack (e.g., numpy, pandas)
  • Experience with streaming platforms (e.g., Kafka)
  • Experience evaluating data quality and maintaining consistently high data standards across new feature releases (e.g., consistency, accuracy, validity, completeness)
  • Our Benefits

  • Stock
  • Competitive Salaries
  • Unlimited paid time off
  • Medical, dental, & vision insurance
  • Health, fitness, and office stipends
  • The permanent ability to work wherever and however you want
  • Comp : $160-180K

    People Data Labs does not discriminate on the basis of race, sex, color, religion, age, national origin, marital status, disability, veteran status, genetic information, sexual orientation, gender identity or any other reason prohibited by law in provision of employment opportunities and benefits.

    Qualified Applicants with arrest or conviction records will be considered for Employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.

    Personal Privacy Policy for California Residents

    https : / / www.peopledatalabs.com / pdf / privacy -policy-and-notice.pdf

    serp_jobs.job_alerts.create_a_job

    Data Engineer • San Francisco, CA, US

    Job_description.internal_linking.related_jobs
    Data Engineer - Multimodal Systems

    Data Engineer - Multimodal Systems

    Zyphra • Palo Alto, CA, US
    serp_jobs.job_card.full_time
    Data Engineer - Multimodal Systems.Zyphra’s datasets and data pipelines across a variety of modalities.Your work will intersect with almost every team at Zyphra. You will be involved in collec...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Data Engineer

    Data Engineer

    PopHealth Learning Center • Oakland, CA, US
    serp_jobs.job_card.full_time
    The PopHealth Learning Center (“PHLC”) is a California Social Purpose Corporation (SPC) committed to transforming how health care is delivered and experienced across California’s ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    AI Incubator - Data Engineer

    AI Incubator - Data Engineer

    Sprinter Health • Menlo Park, CA, US
    serp_jobs.job_card.full_time
    At Sprinter Health, our mission is reimagining how people access care by bringing it directly to their homes.Nearly 30% of patients in the U. For many, the ER becomes their first touchpoint with the...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Staff Data Engineer

    Staff Data Engineer

    Prosper • San Francisco, CA, US
    serp_jobs.job_card.full_time
    We’re hiring a Staff Data Engineer with a solid software engineering background.You’re strong in Python and comfortable with SQL in a DevOps environment.You’ll design, build, and ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior Data Engineer

    Senior Data Engineer

    AngelList • San Francisco, CA, US
    serp_jobs.job_card.full_time
    We exist to accelerate innovation.We do this by giving more people the opportunity to participate in the venture economy by building the financial infrastructure that makes it possible for more peo...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Data Engineer

    Data Engineer

    Forhyre • Sunnyvale, CA, US
    serp_jobs.job_card.full_time
    We are looking for a passionate certified Data Engineer.The successful candidate will turn data into information, information into insight and insight into business decisions.Data analyst responsib...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Data Engineer

    Data Engineer

    Contact Government Services, LLC • San Francisco, CA, US
    serp_jobs.job_card.full_time
    Employment Type : Full-Time, Mid-level.Department : Business Intelligence.CGS is seeking a passionate and driven Data Engineer to support a rapidly growing Data Analytics and Business Intelligence pl...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Data Engineer

    Data Engineer

    SteerBridge • Miramar, CA, US
    serp_jobs.job_card.full_time
    SteerBridge Strategies is a CVE-Verified Service-Disabled, Veteran-Owned Small Business (SDVOSB) delivering a broad spectrum of professional services to the U. Backed by decades of hands-on experien...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Data Engineer 4

    Data Engineer 4

    Cypress HCM • San Jose, CA, US
    serp_jobs.job_card.full_time
    Location : San Jose CA 95110 (Hybrid).Duration : 10 / 01 / 2025 to 2 / 27 / 2026.Design, develop, and maintain scalable and reliable data pipelines to support large-scale data processing.Build and optimize d...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Data Engineer

    Senior Data Engineer

    Plum Inc • San Francisco, CA, US
    serp_jobs.job_card.full_time
    PLUM is a fintech company empowering financial institutions to grow their business through a cutting-edge suite of AI-driven software, purpose-built for lenders and their partners across the financ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Platform Data Engineer

    Platform Data Engineer

    Ohalo • San Francisco, CA, US
    serp_jobs.job_card.full_time
    Ohalo is seeking an experienced.This role involves building and maintaining data pipelines to support our machine learning engineering activities and managing data related to plant phenotypes and g...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Data Engineer

    Data Engineer

    The Rockridge Group • Emeryville, CA, US
    serp_jobs.job_card.full_time
    Google Search console experience required.Google Tag Manager, merchant account or data studio experience preferred.Facebook knowledge will be big plus. Very proficient with installation, data interr...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Data Analytics Engineer

    Data Analytics Engineer

    Parafin • San Francisco, CA, US
    serp_jobs.job_card.full_time
    At Parafin, we’re on a mission to grow small businesses.Small businesses are the backbone of our economy, but traditional banks often don’t have their backs. We build tech that makes it ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Data Engineer

    Senior Data Engineer

    Toyota Research Institute • Los Altos, CA, US
    serp_jobs.job_card.full_time
    At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life.We’re developing new tools and capabilities to amplify the human experience.To lead this tran...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Data Engineer

    Data Engineer

    Cypress HCM • Mountain View, CA, US
    serp_jobs.job_card.full_time
    The Cloud services group is looking for world class engineers to join our technology innovation group focused on the rapid development of cloud based end-to-end mobile applications and services.Thi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Data Engineer 4

    Data Engineer 4

    NextDeavor Inc. • San Jose, CA, US
    serp_jobs.job_card.temporary
    Here's how you'll become a key player with this opportunity : .We are looking for a Splunk expert to join on a short-term contract and help stabilize, optimize, and improve our Splunk environ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Data Engineer

    Data Engineer

    Institute of Foundation Models • Sunnyvale, CA, US
    serp_jobs.job_card.full_time
    About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Data Engineer - Product

    Data Engineer - Product

    xAI • Palo Alto, CA, US
    serp_jobs.job_card.full_time
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted