Machine Learning Research Engineer- Pretraining

Acceler8 Talent
CA, United States
Full-time

ML Research Engineer- Pretraining

Palo Alto, CA

What We're Building

We are entering a new growth phase focused on partnering with commercial entities to adapt and fine-tune our advanced models to meet their specific business needs.

Our achievements in developing, aligning, and deploying cutting-edge models for our high-EQ consumer-facing chatbot have laid a solid foundation for continued success.

With substantial resources and a strong infrastructure, we are well-equipped to support top-tier model finetuning. Joining our team offers an opportunity to bring your expertise to a dynamic, innovation-driven environment that values collaboration.

About Us

We are a small, interdisciplinary AI studio focused on training and fine-tuning state-of-the-art language models for specific commercial applications.

Our mission is to leverage AI to drive significant, positive change. As a public benefit corporation, we prioritize the well-being and happiness of our partners, users, and broader stakeholders.

About the Role

Our pretraining team is responsible for creating and refining the foundational models that enable our AI capabilities for enterprise solutions.

Research engineers in this role will focus on developing large-scale training datasets, optimizing training processes, and innovating model architectures to push the limits of what our models can achieve in enterprise settings.

This role is a good fit if you :

  • Have experience training large-scale language models from scratch or on extensive datasets.
  • Are skilled in managing and efficiently utilizing large compute resources for training.
  • Have a strong background in modern deep learning techniques and architectures, particularly with transformer models, and are proficient in PyTorch.
  • Enjoy experimenting with new training methodologies and hyperparameter tuning to achieve state-of-the-art results.
  • Are familiar with distributed training frameworks and tools like Horovod or DeepSpeed.

Our Work Culture

We prioritize excellence and ownership, with an organizational structure that emphasizes individual responsibility over management hierarchies.

We believe in the power of highly talented individual contributors, providing them with the resources and autonomy to deliver outstanding results.

Teamwork, generosity, and a culture of constructive disagreement are at our core, fostering an environment where positive challenges and new ideas are encouraged.

We also value strong communication, particularly in writing, and maintain a close feedback loop between user experience and AI development.

Engineering Approach

As a vertically integrated AI studio, we build and optimize our entire technology stack in-house, from large foundational model pretraining to the user interface.

We are committed to scale as a driver of progress in AI, developing and deploying new AI generations on one of the largest supercomputers in the world.

Our approach blurs the lines between engineering and research, with a continuous focus on innovation guided by user feedback.

Benefits

We offer generous benefits to ensure a positive, inclusive, and inspiring work environment, including :

  • Unlimited paid time off
  • Parental leave and flexibility for all parents and caregivers
  • Comprehensive medical, dental, and vision plans for US employees
  • Compliance with country-specific benefits for non-US employees
  • Visa sponsorship for new hires
  • Opportunities for personal growth, such as coaching, conference attendance, or specific training

Diversity & Inclusion

We are committed to building personal AIs that serve everyone, and we strive to represent the full spectrum of human experience within our AI studio.

We welcome individuals from all walks of life who possess the right skills and actively cultivate diverse candidate pools for all open roles.

Keywords : Research Engineer, Pretraining, AI Studio, Language Models, Fine-Tuning, Transformer Models, Deep Learning, PyTorch, Distributed Training, Horovod, DeepSpeed, Large-Scale Training, Compute Resources, Innovation, User Feedback, Vertical Integration, AI Development, Model Architecture, Engineering, Artificial intelligence, Machine learning, ML, Deep Learning,

30+ days ago
Related jobs
Promoted
TikTok
San Jose, California

Experienced in one or more of the following topics: Tensorflow, Caffe, MxNet, PyTorch or other machine learning frameworks. ...

Promoted
EvenUp
San Francisco, California

Expertise in one or more areas of machine learning, such as deep learning, reinforcement learning, probabilistic modeling, or optimization. That's why we're seeking a Staff Machine Learning Engineer eager to join EvenUp's mission. Provide technical leadership and mentorship for a highly skilled team...

Promoted
TikTok
San Jose, California

Experience in one or more of the following areas: applied machine learning, machine learning infrastructure, large-scale recommendation system, market-facing machine learning product;. The team is made up of machine learning researchers and engineers, who support and innovate on production recommend...

Promoted
Darwin Recruitment
San Francisco, California

Our mission is to make clear, effective communication accessible to everyone, and we’re looking for a talented Machine Learning Engineer with a passion for NLP to help us achieve this goal. As a Machine Learning Engineer with a focus on NLP, you will play a key role in developing and optimizing our ...

Promoted
TikTok
Mountain View, California

A PhD in CS, Machine Learning, Statistics, Operations Research, or relevant field. Build machine learning solutions to respond to and mitigate business risks in Tiktok products/platforms. Up-level risk machine learning excellence on privacy/compliance, interoperability, risk perception and analysis....

Promoted
WeRide.ai
San Jose, California

Strong understanding of recent advancements inmachinelearning research. PhD in Electrical Engineering, Computer Science/Engineering or a related field. Excellent knowledge of theory and practice of machine learning. Knowledge of common machine learning frameworks. ...

Avispa Technology
South San Francisco, California

A leading biotechnology company is seeking an exceptional Machine Learning Engineer with a passion for building machine learning algorithms and systems that will transform the drug discovery process. Machine Learning Engineer - Research and Development ROCGJP00027758. Solve core research engineering...

TikTok
San Jose, California

Join our vibrant and fast-paced team at TikTok as a Machine Learning Engineer, specializing in Computer Vision (CV), Artificial Intelligence Generated Content (AIGC), or Multimodal machine learning. We are seeking brilliant and motivated graduate software engineers, who are eager to apply their know...

Harrison Clarke
CA, United States

Founding Lead Machine Learning Engineer. This is a founding role where you’ll have the chance to shape the future of the company's AI products and drive key decisions on all things machine learning. Machine Learning or Knowledge Extraction. Develop methods to construct and evaluate AI Data Graphs, i...

HP, Inc.
Palo Alto, California

Experience with machine learning techniques such as deep learning, reinforcement learning, and transfer learning. We are building an impactful full stack team of talented engineers to design, train and integrate machine learning capabilities that can be deployed across multiple platforms. The ideal ...