Talent.com
Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)

Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)

Attachments KingSan Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About The Role

Attachments King is an eCommerce startup in the Heavy Equipment Industry developing proprietary software that flexibly discovers compatibility between equipment and host machine components.

We’re hiring a Staff Data Engineer in an Individual Contributor role (no direct reports) to build and operate the single source of truth for a high‑SKU construction equipment catalog : taxonomy, product ingestion, price / availability pipelines from messy non‑API sources, and the automation layer that scales SKU count, price discovery, and data quality with for a small team with lean headcount.

This role is based in San Francisco, CA. This will be an in-office role and will extend past the standard 40 hours / week of many 9-5 jobs. We have long hours, weekend work sessions, and prioritize a results-driven culture.

Salary, Equity, and Benefits

Base Pay : $245,000 / year

Equity Offered : 2.00% (Options, 1yr Cliff, 4yr vest)

  • No Funding Raised, Most Recent 409A FMV is $10M.

Total Compensation : $295,000 / year

  • TC excludes potential refreshes; equity valued at 409A on grant date, amortized over 4 years
  • Employer-provided Health Insurance

    Employer-provided 401k Plan

    Day‑to‑day scope

  • Taxonomy & PIM modeling : Own category trees, attributes, variants, compatibility metadata, and normalization rules (GS1 / UNSPSC awareness; custom facets for consumer browse paths).
  • Data ingestion (messy source formats) : Build resilient pipelines for CSV / Excel, email attachments, SFTP, scraped HTML, PDFs, and images.
  • Transform & validate : Typed, idempotent ETL / ELT with schema evolution and contract-based QA.
  • Pricing & availability : Schedulers / agents to detect deltas, reconcile conflicts, discover competing listings, and publish to Shopify with guardrails for margin protection.
  • Images : Automation for background removal, resizing, deduping, and attribute extraction (e.g., dimensions, metadata).
  • Analytics : Build merchandising dashboards (assortment growth, price competitiveness, availability, metadata quality).
  • Operations & SRE : Observability, alerting, backfills, SLAs / SLOs, rollback strategies, and cost control.
  • Current Platforms

  • AWS (native-first) : S3, DynamoDB, Neptune, Lambda, Step Functions, ECS / Fargate, EventBridge, SQS / SNS, CloudWatch, SSM Parameter Store.
  • IaC : AWS CDK v2 (Python / TypeScript)
  • ECommerce Platform : Shopify Plus
  • Analytics : Power BI / Microsoft Fabric
  • AI Tooling : Cursor, Devin, Graphite, Personal ChatGPT Pro / Claude Max plans
  • Requests for, and use of, additional AI tools is heavily encouraged

    Core outcomes

    30 days :

  • Ship a production ingestion → normalization → enrichment → publish pipeline for all existing SKUs (2,200); stand up initial PIM data model with faceted attributes optimized for search / browse; wire price & availability watchers for all current vendors (files, web pages, emails, competitor websites).
  • Baseline data quality with automated contracts & tests ; initial operational dashboards (latency, freshness, fill rates, failure rates).
  • 90 days :

  • SKU count increased by 500% (11,000), coverage expanded to support top 100 product families and machine categories rank-ordered by search traffic demand; image set completeness >
  • 95% for top movers; pricing latency

  • AI / agent workflows auto‑extract attributes from PDFs / images; continuous taxonomy evolution with zero-downtime migrations.
  • 365 days :

  • Deliver $9.27M in annual revenue, 100% attributable to zero-touch online orders of managed SKUs.
  • Must‑have requirements

  • 7+ years building production data systems (or commensurate impact) : Python (pandas / polars), SQL (Postgres / Redshift / Snowflake / BigQuery), orchestration (Step Functions / Airflow / Prefect), eventing (SQS / Kafka), object storage (S3), CI / CD, containerization.
  • Ecommerce catalog expertise : PIM concepts (attribute schemas, variants / SKU creation, canonicalization, dedup), Shopify Admin / GraphQL, metafields, collections, feed health.
  • Non‑API data wrangling at scale : Selenium / Playwright for scraping (with robots / legal etiquette, rotation, backoff), email / SFTP ingestion, PDF OCR, document parsing.
  • Data quality & contracts : Great Expectations, Pydantic (typed models), versioned schemas, migration plans, data diffing, idempotency as a base case.
  • Image processing : PIL / Pillow, OpenCV, ImageMagick; batch pipelines and basic color / contrast / compositing.
  • Analytics : Power BI and / or Tableau; metric design for merchandising (coverage, freshness, price index, conversion lift).
  • AI / agentic workflows : Retrieval + tool‑use agents to extract attributes, reconcile conflicts, propose taxonomy changes; prompt chaining; evaluation harnesses; safe‑ops patterns for deterministic fallbacks.
  • Search relevance & indexing : Search relevance for catalogs (Meilisearch / Elastic / OpenSearch) and faceted navigation tuning.
  • AWS : S3, Lambda, Glue / Athena, Step Functions, ECS / Fargate, CloudWatch; IaC via the CDK; strong cost / performance instincts.
  • Nice‑to‑haves

  • Experience with homegrown PIMs
  • Vendor EDI familiarity; GS1 barcoding; UNSPSC mapping.
  • You Might Thrive Here If...

  • You are incredibly ambitious
  • You are a self-starter and intensely curious
  • You are hard-working and relentless, frequently going above and beyond in previous or current roles
  • You are driven by achievement and energized by big, industry-disrupting challenges
  • You want a "hardcore" work environment
  • You want to leave a positive impact on the world
  • About Attachments King

    Attachments King is E-Commerce for Heavy Machinery Attachments. We're pushing the boundaries of the construction industry with innovative proprietary technology that drastically improves the customer experience when purchasing heavy equipment. We firmly prioritize a hard-working, results-driven culture.

    Our bar for talent is high, and we do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. If you are remarkably good at what you do, you belong on our team.

    For US Based Candidates : Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

    This is the most important time to be alive in human history. Join us, and be a part of something incredible.

    serp_jobs.job_alerts.create_a_job

    Staff Data Engineer • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Staff Data Engineer, Analytics

    Staff Data Engineer, Analytics

    Icon VenturesSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    About Quizlet : At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way. Our $1B+ learning platform serves tens of millions of students every ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff Software Engineer, Data

    Staff Software Engineer, Data

    Sift Stack, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Sift, we’re redefining how modern machines are built, tested, and operated.Our platform gives engineers real-time observability over high-frequency telemetry—eliminating bottlenecks and enabling...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Software Engineer - Data

    Staff Software Engineer - Data

    Windfall Data, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Windfall is seeking a Staff Software Engineer to join our Data team.As a Staff Engineer on our data team, you will be building out the core data asset that everything else at Windfall is built on t...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff / Senior AI Engineer

    Staff / Senior AI Engineer

    Airwallex Pty Ltd.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Airwallex is the only unified payments and financial platform for global businesses.Powered by our unique combination of proprietary infrastructure and software, we empower over 150,000 businesses ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Staff AI Engineer

    Senior Staff AI Engineer

    SonicJobsSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    At SonicJobs we're building the next generation of autonomous web agents based on computer-use technologies.Our agents operate on job application flows, like candidates do.If you've ever dreamed of...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)

    Staff Data Engineer, Merchandising Catalog & Taxonomy (IC)

    Attachments KingSan Francisco, CA, US
    serp_jobs.job_card.full_time
    Attachments King is an eCommerce startup in the Heavy Equipment Industry developing proprietary software that flexibly discovers compatibility between equipment and host machine components.We&rsquo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Member of Technical Staff - Product Analytics & Experimentation Design

    Member of Technical Staff - Product Analytics & Experimentation Design

    xAIPalo Alto, CA, US
    serp_jobs.job_card.full_time
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    HiveSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Data Engineer - Internal Tools

    Staff Data Engineer - Internal Tools

    BitGoSan Francisco, CA, US
    serp_jobs.job_card.full_time
    BitGo is the leading infrastructure provider of digital asset solutions, delivering custody, wallets, staking, trading, financing, and settlement services from regulated cold storage.Since our foun...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Software Engineer, Data Platform

    Staff Software Engineer, Data Platform

    Menlo VenturesSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    In this role as a Staff Software Engineer, you'll be the architect of our Data Platform's future, crafting a system that not only meets our current data needs but scales seamlessly with our ambitio...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Data Engineer Data Science and Analytics San Francisco, CA

    Data Engineer Data Science and Analytics San Francisco, CA

    Scale AI, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Software is eating the world, but AI is eating software.We live in unprecedented times – AI has the potential to exponentially augment human intelligence. Every person will have a personal tutor, co...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Data Engineer, Merchandising Catalog & Taxonomy (IC) (San Francisco)

    Staff Data Engineer, Merchandising Catalog & Taxonomy (IC) (San Francisco)

    Attachments KingSan Francisco, CA, US
    serp_jobs.job_card.part_time
    Attachments King is an eCommerce startup in the Heavy Equipment Industry developing proprietary software that flexibly discovers compatibility between equipment and host machine components.Were hir...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff Backend Engineer (Data / API team)

    Staff Backend Engineer (Data / API team)

    HockeyStack, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Applied AI company on a mission to automate sales, marketing, and customer success for B2B companies.We build the most complete and accurate picture of the B2B buyer. We use this data to power appli...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff AI / ML Engineer

    Staff AI / ML Engineer

    ArtisanSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Artisan, we're creating AI Employees, called Artisans, and software which is sleek, easy to use, and replaces the endless stack of point solutions. We're starting with outbound sales and our AI B...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff Data Engineer

    Staff Data Engineer

    TwilioSan Francisco, California, United States
    serp_jobs.job_card.full_time
    At Twilio, we're shaping the future of communications, all from the comfort of our homes.We deliver innovative solutions tohundreds of thousands of businessesand empower millions of developers worl...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Staff Data Scientist - Sales Analytics

    Staff Data Scientist - Sales Analytics

    HarnhamSan Mateo, CA, United States
    serp_jobs.job_card.full_time
    Staff Data Scientist – Sales Analytics.This fast-growing Series E AI SaaS company is redefining how modern engineering teams build and deploy applications. We’re looking for a Staff Data Scientist t...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_hour
    • serp_jobs.job_card.promoted
    Staff Data Scientist / Machine Learning Engineer - Retailer Growth Operations

    Staff Data Scientist / Machine Learning Engineer - Retailer Growth Operations

    StartopsSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Staff Data Scientist / Machine Learning Engineer - Retailer Growth.Build ML models to personalize retailer onboarding and engagement strategies. San Francisco, California, United States.Staff Data S...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    Intuit Inc.Mountain View, CA, United States
    serp_jobs.job_card.full_time
    Come join Intuit as a Staff Machine Learning Engineer!.In this role, you’ll work alongside AI scientists and machine learning engineers to create AI-powered experiences. You’ll be expected to help c...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Engineer, Applied AI

    Staff Engineer, Applied AI

    ZapierSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Zapier is hiring a Staff Applied AI Engineer for our Agents team to make agentic automation useful at scale.You’ll shape how our Agents plan, reason, remember, and safely take action through 8,000+...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    QuantcastSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Quantcast, we're redefining what's possible in digital advertising.As a global Demand Side Platform (DSP) powered by AI, we help marketers connect with the right audiences and deliver measurable...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day