Talent.com
Staff ML Engineer - Infrastructure

Staff ML Engineer - Infrastructure

ChipStackSan Jose, CA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Description

Job Description

About Us

Chips are at the center of today's tech-driven world. But how we design them has not changed in decades, while their complexity and specialization have skyrocketed due to increasing performance demands from applications like AI. We want to change that.

Our team is small, technical, and fast-moving. We’ve built and shipped at the intersection of AI, EDA, and systems software, with deep roots at companies like Qualcomm, Nvidia, Google, Meta, and the Allen Institute for AI. We’re backed by top investors including Khosla Ventures, Cerberus, and Clear Ventures, and already deployed with 10+ innovative customers—from Fortune 100s to cutting-edge AI silicon startups.

About This Role

This role offers a unique opportunity to be part of the founding team at ChipStack, where we are reinventing how modern silicon chips are designed. You will work alongside highly experienced chip designers who have built complex chips, ML scientists who have trained LLMs at scale, and top-notch infrastructure and software engineers. You will get to leverage your experience building ML and data infrastructure and apply it to some of the hardest problems in chip design.

About You

You want to be at a startup because you love to be at the center of all the dynamism that a startup offers.

You are willing to put in the hours and go the extra mile to ensure every customer has an exceptional experience.

You are self-motivated with a sense of urgency and can operate independently without much guidance.

You are not afraid of difficult problems and enjoy venturing into areas you have not explored before.

This Role

We’re looking for a strong, experienced ML Infrastructure Engineer to join our founding team. We are seeking someone with experience designing and scaling ML infrastructure and training pipelines. You’ll be responsible for building the core infrastructure that enables training, fine-tuning, evaluation, and deployment of LLMs across cloud and on-premise environments. Your work will directly impact product capabilities and speed of iteration.

What's needed

5+ years of experience in ML infrastructure or adjacent roles

Deep expertise in Python and experience with training frameworks like PyTorch or TensorFlow

Strong systems engineering skills and experience with distributed training, data pipelines, and performance optimization

Experience deploying ML models to production (REST APIs, batch jobs, streaming pipelines)

Proficiency with cloud platforms (e.g., GCP, AWS) and containerized systems (Docker, Kubernetes)

Experience managing GPU / TPU workloads efficiently

Good communication skills and the ability to work directly with engineers and customers

Prior experience training or fine-tuning LLMs

Experience setting up observability, monitoring, and evaluation pipelines for ML models

What's good to have

Exposure to chip design fundamentals (via coursework or elsewhere)

Experience at an early-stage startup

Our Culture

Challenge status quo : We are innovators who can challenge the status quo and push forward our vision of the world.

Strong opinions, loosely held : We are low on ego, but high on collaboration. We are okay to be wrong and are always open to learning.

Ship fast, ship quality : We ruthlessly prioritize what matters. We build a few things, but at lightning speed with high quality.

Proud of our craft : Attention to detail is in our DNA. We take pride in what we build and ensure they exceed the high standards of the semiconductor industry.

serp_jobs.job_alerts.create_a_job

Staff Engineer Infrastructure • San Jose, CA, US

Job_description.internal_linking.related_jobs
  • serp_jobs.job_card.promoted
Senior Staff Software Engineer, ML Engineering, Perception

Senior Staff Software Engineer, ML Engineering, Perception

WaymoMountain View, CA, United States
serp_jobs.job_card.full_time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
Staff Software Development Engineer (LLM)

Staff Software Development Engineer (LLM)

FortinetSunnyvale, CA, United States
serp_jobs.job_card.full_time
Architect and implement functions to monitor and filter LLM requests / responses in real time, preventing prompt injection attacks and unauthorized data leakage. Build a highly scalable pipeline capab...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
Staff Software Engineer

Staff Software Engineer

SuperDialSan Mateo County, CA, US
serp_jobs.job_card.full_time
SuperDial is seeking a Staff Software Engineer to build and scale the backend systems that power large language model (LLM) applications in healthcare. This role is ideal for an engineer who thrives...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Senior Staff TLM, ML Data Infra

Senior Staff TLM, ML Data Infra

WaymoMountain View, CA, United States
serp_jobs.job_card.full_time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Platform & Infrastructure Engineer

Platform & Infrastructure Engineer

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a Platform & Infrastructure Engineer to join their technology team.Key Responsibilities Engineering new microservices and modifying existing services for higher scalabili...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Software Engineer, ML Data Infrastructure

Software Engineer, ML Data Infrastructure

WaymoMountain View, CA, United States
serp_jobs.job_card.full_time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
Staff Engineer, IAM

Staff Engineer, IAM

VirtualVocationsSan Francisco, California, United States
serp_jobs.job_card.full_time
A company is looking for a Staff Engineer, IAM Control Plane.Key Responsibilities Design and build next-generation IAM primitives for secure user access Develop user-facing permission models and...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
Senior ML Ops Engineer

Senior ML Ops Engineer

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior ML Ops Engineer to join their AI infrastructure team.Key Responsibilities Architect, implement, and maintain end-to-end ML pipelines for data ingestion, training...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
ML Ops Engineer

ML Ops Engineer

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for an ML Ops Engineer to join their AI infrastructure team.Key Responsibilities Architect, implement, and maintain end-to-end ML pipelines Automate model training and deplo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Staff Engineer

Staff Engineer

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Staff Engineer to create technical strategies for key systems and ensure platform scalability. Key Responsibilities : Provide architectural leadership across engineering ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Software Engineer, Internal Infrastructure

Software Engineer, Internal Infrastructure

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Software Engineer, Internal Infrastructure (Europe & UK).Key Responsibilities Build and operate Kubernetes compute superclusters across multiple clouds Partner with cl...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Staff Machine Learning Engineer

Staff Machine Learning Engineer

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Staff Machine Learning Engineer to design, build, and deploy advanced AI systems for financial technology applications. Key Responsibilities Develop and fine-tune large ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
ML Infrastructure Engineer

ML Infrastructure Engineer

PhizenixMenlo Park, CA, US
serp_jobs.job_card.full_time +1
Menlo Park, CA | On-Site | Full-Time / Direct Hire.Looking for ML Infra experts (Bay Area preferred) with deep experience in CUDA, GPU optimization, VLLMs, and LLM inference—pure language focus...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
AI Infrastructure Engineer, ML Data Platform

AI Infrastructure Engineer, ML Data Platform

Scale AI, Inc.San Francisco, CA, United States
serp_jobs.job_card.full_time
Scale's AI Infrastructure team supports both R&D and applied Generative AI initiatives, driving breakthroughs in areas of post-training research such as AI safety, agents, and evaluating state-of-t...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Junior Infrastructure Engineer

Junior Infrastructure Engineer

VirtualVocationsSan Jose, California, United States
serp_jobs.job_card.full_time
A company is looking for a Junior Infrastructure Production Engineer to drive operational excellence and improve cloud platforms. Key Responsibilities Validate and test systems for onboarding and ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
Infrastructure Software Engineer, Public Sector

Infrastructure Software Engineer, Public Sector

Scale AI, Inc.San Francisco, CA, United States
serp_jobs.job_card.full_time
Scale AI is seeking a highly skilled and motivated.Software Engineer, AI Infrastructure & Security.Public Sector Engineering team. As a part of this team, you will play a critical role in delivering...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Staff Engineer

Staff Engineer

Bio-Rad LaboratoriesPleasanton, CA, United States
serp_jobs.job_card.full_time
As a Senior Electrical Engineer, you will play a critical role in designing, debugging, and supporting custom electronics solutions for cutting-edge life science research platforms.You'll drive the...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Principal Engineer, Infrastructure

Principal Engineer, Infrastructure

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a Principal Engineer, Infrastructure.Key Responsibilities Lead the strategy and execution for infrastructure, test automation, and release automation for the cloud analyt...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Staff Technical Services Engineer

Staff Technical Services Engineer

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a Staff Technical Services Engineer.Key Responsibilities Design and implement corporate-scale technical initiatives while leading training efforts for junior team members...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Senior Data Infrastructure Engineer

Senior Data Infrastructure Engineer

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a Senior Data Infrastructure Engineer to enhance its core data infrastructure and support advanced applications. Key Responsibilities Own and maintain data pipeline archit...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30