Research Engineer, Audio Dialogue Job at DeepMind in Mountain View

MediabistroMountain View, CA, United States

job_description.job_card.variable_days_ago

serp_jobs.job_preview.job_type

serp_jobs.job_card.full_time

job_description.job_card.job_description

Snapshot

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

Who are we? Real-Time Dialog team in GDM Audio

Our mission Mission : We are building the next generation of conversational capabilities, powered by the Gemini LLM. Our mission is to empower multimodal conversational agents with groundbreaking speech and audio capabilities. By starting with LLMs that natively understand rich audio input, we aim to create agents that can orchestrate all aspects of a dialog. This includes knowing when to listen and wait, when to interrupt, reading the emotive style, and coordinating complex multimodal interactions that span audio, video, and text. As part of the gemini audio team, our mission is to create, scale, and productionize novel capabilities into the core Gemini Model, impacting many product areas across Google.

Team Objectives :

Create real-time dialog capabilities for Gemini agents that seamlessly span audio, video, and text modalities.

Pioneer end-to-end ML / Gemini architectures that streamline the dialog process, minimizing the need for complex model cascades.

Direct impact on Gemini core model development – setting the direction for real-time dialog capabilities covering pre-training and post-training.

Collaborate on deployments in Gemini Live (GL), Cloud, XR (Glasses), Astra, and other product areas pushing the boundaries of multimodal interaction research.

Scale core modeling advancements to meet product requirements / model families, including latency, safety, and factuality.

Job responsibilities As a Senior Research Engineer, you will :

Lead efforts to produce more natural and capable real-time dialog agents

Collaborate closely with other teams on areas like reasoning, function calling, and multi-agent frameworks

Partner with product teams design, develop, and deploy novel multimodal conversational agents.

Create new data pipelines covering both real and synthetic sources, influencing pre-training and post-training (SFT & RL)

About You

In order to set you up for success as a Research Engineer at Google DeepMind, we look for the following skills and experience :

Bachelor degree in Computer Science, a related field, or equivalent practical experience.

Significant industry experience building and deploying Speech / ML models.

Demonstrated experience in data preparation, training, and evaluation of ML models.

5 years of experience with software development in Python, esp. ML frameworks like Tensorflow, JAX, PyTorch, etc.

In addition, the following would be an advantage :

Ph.D. in Computer Science or a related field.

Experience with multimodal foundation models.

Experience with real-time multimodal dialog systems.

Hands-on experience with the Gemini models (data processing, training, SFT, RL, serving).

Research background in NLP / Generative AI

Experience with C

The US base salary range for this full-time position is between $197,000 – $291,000 bonus equity benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

#J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Audio Engineer • Mountain View, CA, United States

Job_description.internal_linking.related_jobs

serp_jobs.job_card.promoted

Senior Research Scientist, Audio Machine Learning

Google Inc.Mountain View, CA, United States

serp_jobs.job_card.full_time

Senior Research Scientist, Audio Machine Learning.PhD degree in Computer Science, a related field, or equivalent practical experience. One of more scientific publication submission(s) for conference...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

serp_jobs.job_card.promoted
serp_jobs.job_card.new

Machine Learning Engineer, Audio Perception (Mountain View)

WaymoMountain View, CA, United States

serp_jobs.job_card.full_time

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours

Research Engineer – Audio & Speech Models Job at Zyphra in Palo Alto

MediabistroPalo Alto, CA, United States

serp_jobs.job_card.full_time

Research Engineer – Audio & Speech Models.Research Engineer – Audio & Speech Models.Continue with Google Continue with Google. Research Engineer – Audio & Speech Models.Be among the first 25 applica...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

Zyphra Technologies Inc. is hiring : Research Engineer – Audio & Speech Models in

MediabistroPalo Alto, CA, United States

serp_jobs.job_card.full_time

Research Engineer - Audio & Speech Models.Zyphra’s Audio Team, building the next generation of open-source text-to-speech and audio models. You will be deeply involved in the entire model training p...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

Zyphra is hiring : Research Engineer – Audio & Speech Models in Palo Alto

MediabistroPalo Alto, CA, United States

serp_jobs.job_card.full_time

The Rundown AI, Inc. is hiring : Machine Learning Engineer, Audio Perception in M

MediabistroMountain View, CA, United States

serp_jobs.job_card.full_time

Overview Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

Google DeepMind is hiring : Research Engineer, Audio Dialogue in Mountain View

MediabistroMountain View, CA, United States

serp_jobs.job_card.full_time

Mountain View, California, US; New York City, New York, US Snapshot Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engine...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

Research Engineer Audio & Speech Models Job at Zyphra in Palo Alto

MediabistroPalo Alto, CA, United States

serp_jobs.job_card.full_time

Zyphra is an artificial intelligence company based in Palo Alto, California.Research Engineer - Audio & Speech Models.Zyphra's Audio Team, building the next generation of open-source text-to-speech...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

DeepMind is hiring : Research Engineer, Audio Dialogue in Mountain View

MediabistroMountain View, CA, United States

serp_jobs.job_card.full_time

Audio Deep Learning Engineer Job at femtoAI in San Bruno

MediabistroSan Bruno, CA, United States

serp_jobs.job_card.full_time

Join an ambitious team revolutionizing embedded AI at femtoAI! We’re delivering state-of-the-art deep learning solutions to run on our company’s custom hardware. The Sparse Processing Unit (SPU) chi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

Research Engineer Audio & Speech Models

MediabistroPalo Alto, CA, United States

serp_jobs.job_card.full_time

Senior Research Scientist, Audio Machine Learning Job at Google in Mountain View

MediabistroMountain View, CA, United States

serp_jobs.job_card.full_time

Senior Research Scientist, Audio Machine Learning.Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features. As a Research Scientist, you'll setup large-scale te...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

Google Inc. is hiring : Senior Research Scientist, Audio Machine Learning in Moun

MediabistroMountain View, CA, United States

serp_jobs.job_card.full_time

Research Engineer, Audio Dialogue Job at Google DeepMind in Mountain View

MediabistroMountain View, CA, United States

serp_jobs.job_card.full_time

Mountain View, California, US; New York City, New York, US.Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

serp_jobs.job_card.promoted

Senior Research Engineer - Video & Audio Generative AI / ML

CanvaSan Francisco, CA, United States

serp_jobs.job_card.full_time

Senior Research Engineer - Video & Audio Generative AI / ML.Join to apply for the Senior Research Engineer - Video & Audio Generative AI / ML role at Canva. Partnering with Research Scientists on re...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Machine Learning Engineer, Audio Perception

WaymoSan Francisco, CA, United States

serp_jobs.job_card.full_time

Senior Research Engineer - Video & Audio Generative AI / ML Job at Canva in San

MediabistroSan Francisco, CA, United States

serp_jobs.job_card.full_time

Machine Learning Engineer, Audio Perception

MediabistroMountain View, CA, United States

serp_jobs.job_card.full_time