Talent.com
Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)
Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)Attachments King • San Francisco, California, US
serp_jobs.error_messages.no_longer_accepting
Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)

Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)

Attachments King • San Francisco, California, US
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About The Role

Attachments King is an eCommerce startup in the Heavy Equipment Industry developing proprietary software that flexibly discovers compatibility between equipment and host machine components.

We're hiring a Staff Data Engineer in an Individual Contributor role (no direct reports) to build and operate the single source of truth for a high‑SKU construction equipment catalog : taxonomy, product ingestion, price / availability pipelines from messy non‑API sources, and the automation layer that scales SKU count, price discovery, and data quality with for a small team with lean headcount.

This role is based in San Francisco, CA. This will be an in-office role and will extend past the standard 40 hours / week of many 9-5 jobs. We have long hours, weekend work sessions, and prioritize a results-driven culture.

Salary, Equity, and Benefits

Base Pay : $245,000 / year

Equity Offered : 2.00% (Options, 1yr Cliff, 4yr vest)

  • No Funding Raised, Most Recent 409A FMV is $10M.

Total Compensation : $295,000 / year

  • TC excludes potential refreshes; equity valued at 409A on grant date, amortized over 4 years
  • Employer-provided Health Insurance

    Employer-provided 401k Plan

    Day‑to‑day scope

  • Taxonomy & PIM modeling : Own category trees, attributes, variants, compatibility metadata, and normalization rules (GS1 / UNSPSC awareness; custom facets for consumer browse paths).
  • Data ingestion (messy source formats) : Build resilient pipelines for CSV / Excel, email attachments, SFTP, scraped HTML, PDFs, and images.
  • Transform & validate : Typed, idempotent ETL / ELT with schema evolution and contract-based QA.
  • Pricing & availability : Schedulers / agents to detect deltas, reconcile conflicts, discover competing listings, and publish to Shopify with guardrails for margin protection.
  • Images : Automation for background removal, resizing, deduping, and attribute extraction (e.g., dimensions, metadata).
  • Analytics : Build merchandising dashboards (assortment growth, price competitiveness, availability, metadata quality).
  • Operations & SRE : Observability, alerting, backfills, SLAs / SLOs, rollback strategies, and cost control.
  • Current Platforms

  • AWS (native-first) : S3, DynamoDB, Neptune, Lambda, Step Functions, ECS / Fargate, EventBridge, SQS / SNS, CloudWatch, SSM Parameter Store.
  • IaC : AWS CDK v2 (Python / TypeScript)
  • ECommerce Platform : Shopify Plus
  • Analytics : Power BI / Microsoft Fabric
  • AI Tooling : Cursor, Devin, Graphite, Personal ChatGPT Pro / Claude Max plans
  • Requests for, and use of, additional AI tools is heavily encouraged

    Core outcomes

    30 days :

  • Ship a production ingestion → normalization → enrichment → publish pipeline for all existing SKUs (2,200); stand up initial PIM data model with faceted attributes optimized for search / browse; wire price & availability watchers for all current vendors (files, web pages, emails, competitor websites).
  • Baseline data quality with automated contracts & tests ; initial operational dashboards (latency, freshness, fill rates, failure rates).
  • 90 days :

  • SKU count increased by 500% (11,000), coverage expanded to support top 100 product families and machine categories rank-ordered by search traffic demand; image set completeness >
  • 95% for top movers; pricing latency < 15 minutes for tracked vendors; vendor onboarding time < 48 hours from first file to live SKUs.

  • AI / agent workflows auto‑extract attributes from PDFs / images; continuous taxonomy evolution with zero-downtime migrations.
  • 365 days :

  • Deliver $9.27M in annual revenue, 100% attributable to zero-touch online orders of managed SKUs.
  • Must‑have requirements

  • 7+ years building production data systems (or commensurate impact) : Python (pandas / polars), SQL (Postgres / Redshift / Snowflake / BigQuery), orchestration (Step Functions / Airflow / Prefect), eventing (SQS / Kafka), object storage (S3), CI / CD, containerization.
  • Ecommerce catalog expertise : PIM concepts (attribute schemas, variants / SKU creation, canonicalization, dedup), Shopify Admin / GraphQL, metafields, collections, feed health.
  • Non‑API data wrangling at scale : Selenium / Playwright for scraping (with robots / legal etiquette, rotation, backoff), email / SFTP ingestion, PDF OCR, document parsing.
  • Data quality & contracts : Great Expectations, Pydantic (typed models), versioned schemas, migration plans, data diffing, idempotency as a base case.
  • Image processing : PIL / Pillow, OpenCV, ImageMagick; batch pipelines and basic color / contrast / compositing.
  • Analytics : Power BI and / or Tableau; metric design for merchandising (coverage, freshness, price index, conversion lift).
  • AI / agentic workflows : Retrieval + tool‑use agents to extract attributes, reconcile conflicts, propose taxonomy changes; prompt chaining; evaluation harnesses; safe‑ops patterns for deterministic fallbacks.
  • Search relevance & indexing : Search relevance for catalogs (Meilisearch / Elastic / OpenSearch) and faceted navigation tuning.
  • AWS : S3, Lambda, Glue / Athena, Step Functions, ECS / Fargate, CloudWatch; IaC via the CDK; strong cost / performance instincts.
  • Nice‑to‑haves

  • Experience with homegrown PIMs
  • Vendor EDI familiarity; GS1 barcoding; UNSPSC mapping.
  • You Might Thrive Here If...

  • You are incredibly ambitious
  • You are a self-starter and intensely curious
  • You are hard-working and relentless, frequently going above and beyond in previous or current roles
  • You are driven by achievement and energized by big, industry-disrupting challenges
  • You want a "hardcore" work environment
  • You want to leave a positive impact on the world
  • About Attachments King

    Attachments King is E-Commerce for Heavy Machinery Attachments. We're pushing the boundaries of the construction industry with innovative proprietary technology that drastically improves the customer experience when purchasing heavy equipment. We firmly prioritize a hard-working, results-driven culture.

    Our bar for talent is high, and we do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. If you are remarkably good at what you do, you belong on our team.

    For US Based Candidates : Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

    This is the most important time to be alive in human history. Join us, and be a part of something incredible.

    serp_jobs.job_alerts.create_a_job

    Staff Data Engineer • San Francisco, California, US

    Job_description.internal_linking.related_jobs
    Senior Staff AI Engineer

    Senior Staff AI Engineer

    Palo Alto Networks • Santa Clara, CA, US
    serp_jobs.job_card.full_time
    At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer a...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff Software Engineer, Data

    Staff Software Engineer, Data

    Sift Stack, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Sift, we’re redefining how modern machines are built, tested, and operated.Our platform gives engineers real-time observability over high-frequency telemetry—eliminating bottlenecks and enabling...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Software Engineer - Data

    Staff Software Engineer - Data

    Windfall Data, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Windfall is seeking a Staff Software Engineer to join our Data team.As a Staff Engineer on our data team, you will be building out the core data asset that everything else at Windfall is built on t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff / Senior AI Engineer

    Staff / Senior AI Engineer

    Airwallex Pty Ltd. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Airwallex is the only unified payments and financial platform for global businesses.Powered by our unique combination of proprietary infrastructure and software, we empower over 150,000 businesses ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff Software Engineer, AI and Data Technology

    Staff Software Engineer, AI and Data Technology

    Omada Health • South San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Omada Health is on a mission to inspire and engage people in lifelong health, one step at a time.The Staff Software Engineer for AI and Data Technologies will play a critical role in advancing our ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Sr. / Staff Software Engineer, Big Data

    Sr. / Staff Software Engineer, Big Data

    Predactiv • Palo Alto, CA, US
    serp_jobs.job_card.full_time
    ShareThis, a Predactiv Company is a big data company that owns online behavior data of 1b+ users globally.We are developing an audience intelligence platform with cutting edge big data technologies...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Data Engineer- Data Architect

    Staff Data Engineer- Data Architect

    Headspace • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    About the Staff Data Engineer at Headspace.At Headspace, our mission is to transform mental healthcare to improve the health and happiness of the world. We’re looking for an experienced Data Archite...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Software Engineer (Data)

    Staff Software Engineer (Data)

    Amigo • San Francisco, CA, US
    serp_jobs.job_card.full_time
    Amigo builds trust and safety infrastructure for AI in mission-critical environments.We partner with organizations in healthcare and other regulated sectors to deploy AI systems that operate reliab...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI Incubator - Data Engineer

    AI Incubator - Data Engineer

    Sprinter Health • Menlo Park, CA, US
    serp_jobs.job_card.full_time
    At Sprinter Health, our mission is reimagining how people access care by bringing it directly to their homes.Nearly 30% of patients in the U. For many, the ER becomes their first touchpoint with the...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)

    Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)

    Attachments King • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Attachments King is an eCommerce startup in the Heavy Equipment Industry developing proprietary software that flexibly discovers compatibility between equipment and host machine components.We’re hi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr Staff Engineer - AI and Data Platform

    Sr Staff Engineer - AI and Data Platform

    CyberCoders • San Francisco, CA, US
    serp_jobs.job_card.full_time
    Sr Staff Engineer - AI and Data Platform.Sr Staff Engineer - AI and Data Platform.We are seeking a highly skilled Sr Staff Engineer to lead the development and enhancement of our AI and Data Platfo...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff Data Engineer, Merchandising Catalog & Taxonomy (IC) (San Francisco)

    Staff Data Engineer, Merchandising Catalog & Taxonomy (IC) (San Francisco)

    Attachments King • San Francisco, CA, US
    serp_jobs.job_card.part_time
    Attachments King is an eCommerce startup in the Heavy Equipment Industry developing proprietary software that flexibly discovers compatibility between equipment and host machine components.Were hir...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff Data Engineer - Substantiation Platform

    Staff Data Engineer - Substantiation Platform

    GEICO • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Staff Data Engineer - Substantiation Platform \ •REMOTE\ • page is loaded## Staff Data Engineer - Substantiation Platform \ •REMOTE\ •remote type : Remotelocations : San Francisco, CAtime type : Ful...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff Software Engineer - Data Platform

    Staff Software Engineer - Data Platform

    Hive • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Every day, we process data from millions of KM from 10s of thousands of high resolution sensors deployed around the world. A symphony of different Sensor Fusion, ML, AI, and 3D Sensing processes are...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Staff Engineer, AI and Data Platform

    Sr. Staff Engineer, AI and Data Platform

    Quizlet • San Francisco, CA, US
    serp_jobs.job_card.full_time
    At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way.Our $1B+ learning platform serves tens of millions of students every month, in...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Staff Data Scientist

    Staff Data Scientist

    Lark Health • Mountain View, CA, US
    serp_jobs.job_card.full_time
    At Lark Health, we’re leading the way into a new era of cardiometabolic care, leveraging advanced AI techniques–including deterministic and generative models–to provide scalable, ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff Data Engineer - Internal Tools

    Staff Data Engineer - Internal Tools

    BitGo • Palo Alto, CA, US
    serp_jobs.job_card.full_time
    BitGo is the leading infrastructure provider of digital asset solutions, delivering custody, wallets, staking, trading, financing, and settlement services from regulated cold storage.Since our foun...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Data Engineer, Energy

    Staff Data Engineer, Energy

    GoodLeap • San Francisco, CA, US
    serp_jobs.job_card.full_time
    GoodLeap is a technology company delivering best-in-class financing and software products for sustainable solutions, from solar panels and batteries to energy-efficient HVAC, heat pumps, roofing, w...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Staff Data Scientist

    Staff Data Scientist

    Windfall • San Francisco, CA, US
    serp_jobs.job_card.full_time
    At Windfall, data science is central to our mission as a people data and AI company.We aim to revolutionize how organizations perceive and utilize people data by providing leading commercial and no...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Data Engineer - Robotics & Foundation Models

    Data Engineer - Robotics & Foundation Models

    Approach Venture • Berkeley, CA, US
    serp_jobs.job_card.full_time
    Data Engineer – Help Build the Future of Intelligent Robotics Systems!.Join a fast-growing robotics startup on a mission to redefine how machines learn, adapt, and collaborate with humans.As ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted