Machine Learning Research Intern, Multi-Modal Foundation Models (Robotics)

Karkidi
Los Altos, California, US
$45-$65 an hour
Internship

At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience.

To lead this transformative shift in mobility, we’ve built a world-class team in Robotics, Human-Centered AI, Human Interactive Driving, and Energy & Materials.

Below covers everything you need to know about what this opportunity entails, as well as what is expected from applicants.

This is a Summer 2024 paid 12-week internship opportunity. Please note that this internship will be a hybrid in-office role.

The Team

Our Machine Learning Language team is embedded with our Robotics team and is looking for Research Interns for Summer 2024 in a variety of areas such as large language (LLM) (training and fine-tuning), vision-language (VLMs), vision-language-action (VLAs) and other multi-modal foundation models.

We are interested in better approaches for alignment, model architectures, distillation and scalability, in addition to looking at approaches to handle new challenges in planning and compositionality where current LLMs fail.

We are aiming to make progress on some of the hardest scientific challenges around multi-modal foundation models for downstream applications in assistive robotics and across Toyota.

Multi-modal modeling is a core component of our robotics architecture as the team works towards Large Behavior Models.

The Internship

As a Research Intern, you will work with a multidisciplinary team proposing and conducting pioneering research in Machine Learning.

You will use large amounts of text, image, and other data to solve open problems and work towards publications at top academic venues!

Responsibilities

  • Conduct daring research, primarily at the intersection of Natural Language Processing and Computer Vision, that solves open problems of high practical and / or ethical value, and validate it in real-world benchmarks and systems.
  • Push the boundaries of knowledge and the state of the art in areas including language and multi-modal models.
  • Partner with a multidisciplinary team including other research scientists and engineers across the Robotics Machine Learning teams.
  • Stay up to date on the state-of-the-art in Machine Learning ideas and software.
  • Present results in verbal and written communications at international conferences, internally, and via open-source contributions to the community.

Qualifications

  • Currently pursuing a Ph.D. in Machine Learning, Natural Language Processing, Computer Vision, Robotics or related fields.
  • Publications or desire to publish at high-impact conferences / journals (e.g., NAACL, ICLR, NeurIPS, ICML, COLM, TMLR, EMNLP, *ACL etc.

on some of the aforementioned topics.

  • Proficiency with one or more coding languages and systems, preferably Python, Unix, and a Deep Learning framework (e.g., PyTorch).
  • Proficiency in engineering best practices for model and data scaling for large-scale model training.
  • Passionate about modern natural language and multi-modal processing, including training, understanding, and aligning large language models with human values.
  • Ability to work in collaboration with other researchers and engineers to invent and develop interesting research ideas.
  • Ability to execute on research projects, working in collaboration with other members of the team.
  • A reliable teammate who loves to think big, go deeper, and strives to deliver with integrity.

Please add a link to Google Scholar and include a full list of publications when submitting your CV to this position.

The pay range for this position at commencement of employment is expected to be between $45 and $65 / hour for California-based roles;

however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.

Note that TRI offers a generous benefits package including vacation and sick time. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.

J-18808-Ljbffr

14 days ago
Related jobs
Promoted
Apple
Sunnyvale, California

We (Spatial Perception Team) looking for a machine learning researcher to work on the field of Generative AI and multi-modal foundation models. Join us in this truly exciting era of Artificial Intelligence to help deliver the next groundbreaking Apple product & experiences! As a member of our dynami...

Promoted
VirtualVocations
Santa Clara, California

A company is looking for a Machine Learning Research Engineer to lead the decoding of electrochemical signals from plants. ...

Promoted
Apple
Cupertino, California

We are looking for a Machine Learning Research Engineer to help deliver scalable, multilingual NLP solutions that empower our users to use intelligent text input in their language of choice. As a Machine Learning Research Engineer on our team, you will build and iteratively refine model pipelines th...

Promoted
Apple Inc.
Cupertino, California

The Data and Machine Learning Innovation team focuses on innovative technologies, methodologies, and research to enable amazing user experiences and advance the frontier of machine learning at Apple. This role is highly multi-functional, and you will collaborate very closely with various highly skil...

Promoted
Robotics Technologies LLC
Sunnyvale, California

Proficiency in machine learning algorithms such as multi-class classifications, decision trees, support vector machines and deep learning. Strong understanding of probability and statistical models (generative and descriptive models). Ability to collaborate effectively across multiple teams and stak...

TikTok
San Jose, California

Research experience in one or more of the following fields: applied machine learning, machine learning infrastructure, large-scale recommendation system, market-facing machine learning product;2. Powered by world-class machine learning technology, the TikTok Live Recommendation Team aims at providin...

ByteDance
San Jose, California

The Applied Machine Learning Enterprise team combines system engineering and machine learning to develop and operate massively distributed machine learning training, inference systems and services to serve both the big model vendors and users around the world. Preferred Qualifications:- Experience i...

TikTok
San Jose, California

Participate in the development and iteration of Ads algorithms by using Machine Learning, including ads query understanding, ads targeting, ads ranking, model serving reliability, etc. Good theoretical grounding in the machine and deep learning concepts and techniques (CNN/RNN/LSTM,. Familiar with t...

ByteDance
San Jose, California

Team IntroductionThe AML Machine Learning Systems team provides E2E machine learning experience and machine learning resources for the company. Responsibilities- Research and develop key components of machine learning systems, including distributed frameworks, cluster scheduling, storage systems, et...

ByteDance
San Jose, California

Strong research background in AI and machine learning with solid publications in leading conferences (, ICML, NeurIPS, ICLR) and journals, encompassing areas like large language models, diffusion models, geometric deep learning, natural language processing, computational protein design, protein stru...