Talent.com
Research Engineer Audio & Speech Models

Research Engineer Audio & Speech Models

MediabistroPalo Alto, CA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Zyphra

Zyphra is an artificial intelligence company based in Palo Alto, California.

The Role :

As a Research Engineer - Audio & Speech Models , you will be a core contributor on Zyphra's Audio Team, building the next generation of open-source text-to-speech and audio models. You will be deeply involved in the entire model training process from data gathering and processing to designing novel architectures and training methodologies.

You'll work across :

  • Large-scale audio training runs
  • Performance optimization of our training stack
  • Audio dataset collection, processing, and evaluation
  • Architecture and training methodology ablations and improvements

Requirements :

  • Strong research taste and intuition. The ability to work through a research project from conception to execution to write-up.
  • Strong implementation and prototyping ability (can take an idea from conception to experimentation quickly)
  • The ability to work well with others in a high-paced research setting
  • Can rapidly learn new fields and are excited to implement new ideas
  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale.
  • Bonus Qualifications :

  • Expertise and intuition for training models in the audio domain, including text-to-speech, ASR, speech-to-speech, speech-emotion-recognition, or other models
  • Experience in training audio autoencoders.
  • Understanding of signal processing, especially of audio signals.
  • Experience with diffusion models, consistency models, or GANs
  • Experience with training on large-scale (multi-node) GPU clusters
  • Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing
  • Understanding of and interest in large-scale, highly parallel data processing pipelines.
  • Proficiency with PyTorch and Python.
  • Experience contributing to large pre-existing codebases and rapidly getting up to speed.
  • Previously published machine learning research in well-respected venues.
  • Postgraduate degree in a scientific subject (Computer Science, EE / EECS, Mathematics, Physics, Machine Learning)
  • Why Work at Zyphra :

  • Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued
  • We strongly value new and crazy ideas and are very willing to bet big on new ideas
  • We move as quickly as we can; we aim to minimize the bar to impact as low as possible
  • We all enjoy what we do and love discussing AI
  • Benefits and Perks :

  • Comprehensive medical, dental, vision, and FSA plans
  • Competitive compensation and 401(k)
  • Relocation and immigration support on a case-by-case basis
  • On-site meals prepared by a dedicated culinary team; Thursday Happy Hours
  • In-person team in Palo Alto, CA, with a collaborative, high-energy environment
  • serp_jobs.job_alerts.create_a_job

    Audio Engineer • Palo Alto, CA, United States

    Job_description.internal_linking.related_jobs
    Staff Audio DSP Engineer, Infotainment Platform Job at Rivian and Volkswagen Gro

    Staff Audio DSP Engineer, Infotainment Platform Job at Rivian and Volkswagen Gro

    MediabistroPalo Alto, CA, United States
    serp_jobs.job_card.full_time +1
    Staff Audio DSP Engineer, Infotainment Platform.Staff Audio DSP Engineer, Infotainment Platform.Rivian and Volkswagen Group Technologies. Staff Audio DSP Engineer, Infotainment Platform.Be among the...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Applied Audio ML Engineer Job at David AI in San Francisco

    Applied Audio ML Engineer Job at David AI in San Francisco

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    This range is provided by David AI.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. About Our Machine Learning Team.Our Machine Learning team sit...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.new
    Audio Mechanical Design Engineer

    Audio Mechanical Design Engineer

    MediabistroSunnyvale, CA, United States
    serp_jobs.job_card.full_time
    This range is provided by Henderson Scott.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Senior Recruitment Consultant - Connecting Cyber Secur...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    Waymo is hiring : Machine Learning Engineer, Audio Perception in San Francisco

    Waymo is hiring : Machine Learning Engineer, Audio Perception in San Francisco

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Research Engineer - Multimodal & Video Foundation Model (Remote)

    Senior Research Engineer - Multimodal & Video Foundation Model (Remote)

    Tether Operations LimitedSan Francisco, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re pioneering a global financial revolution with solutions that empower businesses—from exchanges and wallets to payment processors...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 St

    Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 St

    MediabistroSan Jose, CA, United States
    serp_jobs.job_card.full_time
    Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD) at ByteDance.Join to apply for the Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) -...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Senior Machine Learning Engineer, Audio Perception Job at Waymo in San Francisco

    Senior Machine Learning Engineer, Audio Perception Job at Waymo in San Francisco

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Senior Machine Learning Engineer, Audio Perception.Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. The Waymo Driver powers Waymo’s fully aut...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Machine Learning Engineer, Audio Perception Job at Waymo in San Francisco

    Machine Learning Engineer, Audio Perception Job at Waymo in San Francisco

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.new
    Mercor is hiring : Audio Engineer - Sound Design Expert in San Francisco

    Mercor is hiring : Audio Engineer - Sound Design Expert in San Francisco

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Audio Engineer - Sound Design Expert.This range is provided by Mercor.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Headquartered in San Franc...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    Founding Applied ML Engineer Audio & Speech Job at Gulf Coast Automation Group L

    Founding Applied ML Engineer Audio & Speech Job at Gulf Coast Automation Group L

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time +1
    Founding Applied ML Engineer Audio & Speech.Location : San Francisco (strongly preferred) or New York City.Position Type : Full-Time (Permanent). A fast-scaling early-stage startup is building the wor...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Audio Deep Learning Engineer Job at femtoAI in San Bruno

    Audio Deep Learning Engineer Job at femtoAI in San Bruno

    MediabistroSan Bruno, CA, United States
    serp_jobs.job_card.full_time
    Join an ambitious team revolutionizing embedded AI at femtoAI! We’re delivering state-of-the-art deep learning solutions to run on our company’s custom hardware. The Sparse Processing Unit (SPU) chi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Canva is hiring : Senior Research Engineer - Video & Audio Generative AI / ML in

    Canva is hiring : Senior Research Engineer - Video & Audio Generative AI / ML in

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Overview Senior Research Engineer - Video & Audio Generative AI / ML Join to apply for the Senior Research Engineer - Video & Audio Generative AI / ML role at Canva Responsibilities Partnering w...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Audio Software Engineer (Generalist), Infotainment Platform

    Audio Software Engineer (Generalist), Infotainment Platform

    MediabistroPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    Audio Software Engineer (Generalist), Infotainment Platform.Audio Software Engineer (Generalist), Infotainment Platform.Rivian and Volkswagen Group Technologies. Audio Software Engineer (Generalist)...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.new
    Audio Engineer | Upto $20 / hr Remote Job at Mercor in San Francisco

    Audio Engineer | Upto $20 / hr Remote Job at Mercor in San Francisco

    MediabistroSan Francisco, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Audio Engineer | Upto $20 / hr Remote.This range is provided by Mercor.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Headquartered in San Franci...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    David AI is hiring : Applied Audio ML Engineer in San Francisco

    David AI is hiring : Applied Audio ML Engineer in San Francisco

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    David AI is the first audio data research company.We bring an R&D approach to data-developing datasets with the same rigor AI labs bring to models. Our mission is to bring AI into the real world, an...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Research Scientist Multi-channel Audio Processing and Machine Learning Job at Oc

    Research Scientist Multi-channel Audio Processing and Machine Learning Job at Oc

    MediabistroBurlingame, CA, United States
    serp_jobs.job_card.full_time
    Research Scientist Multi-channel Audio Processing and Machine Learning.We are developing technologies to enable breakthrough Smartglasses, AR glasses, and VR headsets. The audio team within RL Resea...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Research Engineer - Video & Audio Generative AI / ML

    Senior Research Engineer - Video & Audio Generative AI / ML

    CanvaSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Senior Research Engineer - Video & Audio Generative AI / ML.Join to apply for the Senior Research Engineer - Video & Audio Generative AI / ML role at Canva. Partnering with Research Scientists on re...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Senior Research Engineer - Video & Audio Generative AI / ML

    Senior Research Engineer - Video & Audio Generative AI / ML

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Overview Senior Research Engineer - Video & Audio Generative AI / ML Join to apply for the Senior Research Engineer - Video & Audio Generative AI / ML role at Canva Responsibilities Partnering with...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Machine Learning Engineer, Siri Speech

    Machine Learning Engineer, Siri Speech

    Apple Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    San Francisco Bay Area, California, United States Machine Learning and AI.Are you excited about Generative AI and Large Language Models? Are you interested in working on cutting-edge generative mod...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Tesla Motors, Inc. is hiring : Audio Hardware Engineer, Infotainment in Palo Alto

    Tesla Motors, Inc. is hiring : Audio Hardware Engineer, Infotainment in Palo Alto

    MediabistroPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    Tesla is seeking an Audio Engineer within the Infotainment Team who has had 5+ years of design, test and release responsibility of Acoustics, Microphones and Signal Processing products.This Enginee...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days