Machine Learning Research Engineer- Pretraining

Acceler8 Talent

CA, United States

Full-time

ML Research Engineer- Pretraining

Palo Alto, CA

What We're Building

We are entering a new growth phase focused on partnering with commercial entities to adapt and fine-tune our advanced models to meet their specific business needs.

Our achievements in developing, aligning, and deploying cutting-edge models for our high-EQ consumer-facing chatbot have laid a solid foundation for continued success.

With substantial resources and a strong infrastructure, we are well-equipped to support top-tier model finetuning. Joining our team offers an opportunity to bring your expertise to a dynamic, innovation-driven environment that values collaboration.

About Us

We are a small, interdisciplinary AI studio focused on training and fine-tuning state-of-the-art language models for specific commercial applications.

Our mission is to leverage AI to drive significant, positive change. As a public benefit corporation, we prioritize the well-being and happiness of our partners, users, and broader stakeholders.

About the Role

Our pretraining team is responsible for creating and refining the foundational models that enable our AI capabilities for enterprise solutions.

Research engineers in this role will focus on developing large-scale training datasets, optimizing training processes, and innovating model architectures to push the limits of what our models can achieve in enterprise settings.

This role is a good fit if you :

Have experience training large-scale language models from scratch or on extensive datasets.
Are skilled in managing and efficiently utilizing large compute resources for training.
Have a strong background in modern deep learning techniques and architectures, particularly with transformer models, and are proficient in PyTorch.
Enjoy experimenting with new training methodologies and hyperparameter tuning to achieve state-of-the-art results.
Are familiar with distributed training frameworks and tools like Horovod or DeepSpeed.

Our Work Culture

We prioritize excellence and ownership, with an organizational structure that emphasizes individual responsibility over management hierarchies.

We believe in the power of highly talented individual contributors, providing them with the resources and autonomy to deliver outstanding results.

Teamwork, generosity, and a culture of constructive disagreement are at our core, fostering an environment where positive challenges and new ideas are encouraged.

We also value strong communication, particularly in writing, and maintain a close feedback loop between user experience and AI development.

Engineering Approach

As a vertically integrated AI studio, we build and optimize our entire technology stack in-house, from large foundational model pretraining to the user interface.

We are committed to scale as a driver of progress in AI, developing and deploying new AI generations on one of the largest supercomputers in the world.

Our approach blurs the lines between engineering and research, with a continuous focus on innovation guided by user feedback.

Benefits

We offer generous benefits to ensure a positive, inclusive, and inspiring work environment, including :

Unlimited paid time off
Parental leave and flexibility for all parents and caregivers
Comprehensive medical, dental, and vision plans for US employees
Compliance with country-specific benefits for non-US employees
Visa sponsorship for new hires
Opportunities for personal growth, such as coaching, conference attendance, or specific training

Diversity & Inclusion

We are committed to building personal AIs that serve everyone, and we strive to represent the full spectrum of human experience within our AI studio.

We welcome individuals from all walks of life who possess the right skills and actively cultivate diverse candidate pools for all open roles.

Keywords : Research Engineer, Pretraining, AI Studio, Language Models, Fine-Tuning, Transformer Models, Deep Learning, PyTorch, Distributed Training, Horovod, DeepSpeed, Large-Scale Training, Compute Resources, Innovation, User Feedback, Vertical Integration, AI Development, Model Architecture, Engineering, Artificial intelligence, Machine learning, ML, Deep Learning,

30+ days ago

Related jobs

Machine Learning Engineer Graduate (Location Based Service) - 2025 Start (PhD)

San Jose, California

Experienced in one or more of the following topics: Tensorflow, Caffe, MxNet, PyTorch or other machine learning frameworks. ...

Staff Machine Learning Engineer

San Francisco, California

Expertise in one or more areas of machine learning, such as deep learning, reinforcement learning, probabilistic modeling, or optimization. That's why we're seeking a Staff Machine Learning Engineer eager to join EvenUp's mission. Provide technical leadership and mentorship for a highly skilled team...

Tech Lead Machine Learning Engineer, Feed Quality

San Jose, California

Experience in one or more of the following areas: applied machine learning, machine learning infrastructure, large-scale recommendation system, market-facing machine learning product;. The team is made up of machine learning researchers and engineers, who support and innovate on production recommend...

Machine Learning Engineer

Darwin Recruitment

San Francisco, California

Our mission is to make clear, effective communication accessible to everyone, and we’re looking for a talented Machine Learning Engineer with a passion for NLP to help us achieve this goal. As a Machine Learning Engineer with a focus on NLP, you will play a key role in developing and optimizing our ...

Machine Learning Engineer, E-Commerce Risk Control - USDS

Mountain View, California

A PhD in CS, Machine Learning, Statistics, Operations Research, or relevant field. Build machine learning solutions to respond to and mitigate business risks in Tiktok products/platforms. Up-level risk machine learning excellence on privacy/compliance, interoperability, risk perception and analysis....

Machine Learning Engineer

San Jose, California

Strong understanding of recent advancements inmachinelearning research. PhD in Electrical Engineering, Computer Science/Engineering or a related field. Excellent knowledge of theory and practice of machine learning. Knowledge of common machine learning frameworks. ...

Machine Learning Engineer - Research and Development

Avispa Technology

South San Francisco, California

A leading biotechnology company is seeking an exceptional Machine Learning Engineer with a passion for building machine learning algorithms and systems that will transform the drug discovery process. Machine Learning Engineer - Research and Development ROCGJP00027758. Solve core research engineering...

Machine Learning Engineer Graduate (E-Commerce Supply Chain & Logistics - CV/Multimodal) - 2025 Start (PhD)

San Jose, California

Join our vibrant and fast-paced team at TikTok as a Machine Learning Engineer, specializing in Computer Vision (CV), Artificial Intelligence Generated Content (AIGC), or Multimodal machine learning. We are seeking brilliant and motivated graduate software engineers, who are eager to apply their know...

Senior Machine Learning Engineer

Harrison Clarke

CA, United States

Founding Lead Machine Learning Engineer. This is a founding role where you’ll have the chance to shape the future of the company's AI products and drive key decisions on all things machine learning. Machine Learning or Knowledge Extraction. Develop methods to construct and evaluate AI Data Graphs, i...

Machine Learning Engineer

Palo Alto, California

Experience with machine learning techniques such as deep learning, reinforcement learning, and transfer learning. We are building an impactful full stack team of talented engineers to design, train and integrate machine learning capabilities that can be deployed across multiple platforms. The ideal ...