Machine Learning Co-Design Researcher Job at Etched in San Jose

MediabistroSan Jose, CA, United States

job_description.job_card.30_days_ago

serp_jobs.job_preview.job_type

serp_jobs.job_card.full_time

job_description.job_card.job_description

Join to apply for the Machine Learning Co-Design Researcher role at Etched

About Etched : Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents.

Key Responsibilities :

Translate core mathematical operations from transformer models into optimized operation sequences for Sohu
Develop and leverage a deep understanding of Sohu to co-design both HW instructions and model architecture operations to maximize model performance
Implement high-performance software components for the Model Toolkit
Collaborate with hardware engineers to maximize chip utilization and minimize latency
Implement efficient batching strategies and execution plans for inference workloads
Design and implement cutting edge inference time compute scaling methods
Alter and fine-tune model architectures or inference time compute algorithms
Contribute to the evolution of our system architecture and programming model

Representative projects :

Optimize operation sequences to maximize Sohu's computational resources for specific transformer architectures such as Llama 4.

Research and implement efficient memory management for KV cache sharing and prefix optimization

Develop algorithms for continuous batching and batch interleaving to improve throughput and / or latency

Research and implement model-specific inference-time acceleration algorithms such as speculative decoding, tree search, KV cache sharing, priority scheduling, etc by interacting with the rest of the inference serving stack

Research and implement structured decoding and novel sampling algorithms for reasoning models

You may be a good fit if you have :

Co-design expertise across both SW and HW domains

Strong software engineering skills with systems programming experience

Deep knowledge of transformer model architectures and / or inference serving stacks (vLLM, SGLang, etc.)

Strong mathematical skills, esp. in linear algebra

Ability to reason about performance bottlenecks and optimization opportunities

Experience working cross-functionally in diverse software and hardware organizations

Strong Candidates May Also Have Experience With :

Experience with hardware accelerators, ASICs, or FPGAs

Experience with Rust programming language

Deep expertise in ML systems engineering and hardware / software co-design with demonstrated impact (contributions to open-source projects or published papers)

Track record of optimizing large co-designed SW / HW systems

Benefits :

Full medical, dental, and vision packages, with generous premium coverage

Housing subsidy of $2,000 / month for those living within walking distance of the office

Daily lunch and dinner in our office

Relocation support for those moving to West San Jose

Compensation Range : $150,000 - $275,000

How We’re Different : Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

We are a fully in-person team in West San Jose, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

#J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Machine Learning Researcher • San Jose, CA, United States

Job_description.internal_linking.related_jobs

serp_jobs.job_card.promoted
serp_jobs.job_card.new

Machine Learning Engineer, Prediction

WaymoMountain View, CA, United States

serp_jobs.job_card.full_time

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours

serp_jobs.job_card.promoted

Machine Learning Engineer, Core Engineering

PinterestSan Francisco, CA, United States

serp_jobs.job_card.full_time

Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Research Scientist, Computational Design

SRI InternationalMenlo Park, CA, United States

serp_jobs.job_card.full_time

Research Scientist, Computational Design.The Computational Design (CD) group of the AI Center at SRI International is seeking a strong and enthusiastic researcher with a Ph.Founded in 1966, SRI's A...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Sr Machine Learning Engineer, Applied Research Science

PinterestSan Francisco, CA, United States

serp_jobs.job_card.full_time

serp_jobs.job_card.promoted

Sr. Machine Learning Engineer / Economist, Ads Marketplace

PinterestSan Francisco, CA, United States

serp_jobs.job_card.full_time

serp_jobs.job_card.promoted

Machine Learning Engineer, GenAI Applied ML

Scale AI, Inc.San Francisco, CA, United States

serp_jobs.job_card.full_time

At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Sr Staff Machine Learning Engineer, Closeup Relevance

PinterestPalo Alto, CA, United States

serp_jobs.job_card.full_time

serp_jobs.job_card.promoted

Staff Machine Learning Engineer, Perception, Semantics

WaymoMountain View, CA, United States

serp_jobs.job_card.full_time

serp_jobs.job_card.promoted

Applied Machine Learning Engineer

VirtualVocationsFremont, California, United States

serp_jobs.job_card.full_time

A company is looking for an Applied Machine Learning Engineer, Circuit Design - New College Grad 2025.Key Responsibilities Collaborate with a multi-functional team on Pre-silicon and Post Silicon...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Staff Machine Learning Engineer, Search

PinterestSan Francisco, CA, United States

serp_jobs.job_card.full_time

serp_jobs.job_card.promoted

Machine Learning Engineer, Perception and Sensor Simulation

WaymoMountain View, CA, United States

serp_jobs.job_card.full_time

serp_jobs.job_card.promoted

Machine Learning Engineer

VirtualVocationsFremont, California, United States

serp_jobs.job_card.full_time

A company is looking for a Machine Learning Engineer (Contract).Key Responsibilities Develop and implement machine learning models and algorithms for business solutions Collaborate with data sci...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Machine Learning Research Engineer, Enterprise GenAI

Scale AI, Inc.San Francisco, CA, United States

serp_jobs.job_card.full_time

AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

serp_jobs.job_card.promoted

Sr. Staff Machine Learning Engineer, Search

PinterestSan Francisco, CA, United States

serp_jobs.job_card.full_time

serp_jobs.job_card.promoted

ML Research Engineer, ML Systems

Scale AI, Inc.San Francisco, CA, United States

serp_jobs.job_card.full_time

Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Mixed-Method UX Researcher

VirtualVocationsHayward, California, United States

serp_jobs.job_card.full_time

A company is looking for a Mixed-Method UX Researcher.Key Responsibilities : Develop and share detailed research plans with stakeholders Select appropriate research methods for projects, includin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day

serp_jobs.job_card.promoted

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale AI, Inc.San Francisco, CA, United States

serp_jobs.job_card.full_time

Scale works with the industry's leading AI labs to provide high quality data and accelerate progress in GenAI research.We are looking for Research Scientists and Research Engineers with expertise i...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted
serp_jobs.job_card.new

Machine Learning Engineer, Planning

WaymoSan Francisco, CA, United States

serp_jobs.job_card.full_time

serp_jobs.job_card.promoted

UX Researcher

VirtualVocationsHayward, California, United States

serp_jobs.job_card.full_time

A company is looking for a UX Researcher V.Key Responsibilities Collaborate with researchers, designers, PMs, data scientists, and engineers to deliver meaningful sharing experiences Develop and...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Senior Machine Learning Engineer

VirtualVocationsFremont, California, United States

serp_jobs.job_card.full_time

A company is looking for a Senior Machine Learning Engineer to join their Data Science team.Key Responsibilities : Design, build, and deploy end-to-end machine learning systems and data pipelines ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30