Machine Learning Research Intern, Multi-Modal Foundation Models (Robotics)

Karkidi
Los Altos, California, US
$45-$65 an hour
Internship
We are sorry. The job offer you are looking for is no longer available.

At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience.

To lead this transformative shift in mobility, we’ve built a world-class team in Robotics, Human-Centered AI, Human Interactive Driving, and Energy & Materials.

Below covers everything you need to know about what this opportunity entails, as well as what is expected from applicants.

This is a Summer 2024 paid 12-week internship opportunity. Please note that this internship will be a hybrid in-office role.

The Team

Our Machine Learning Language team is embedded with our Robotics team and is looking for Research Interns for Summer 2024 in a variety of areas such as large language (LLM) (training and fine-tuning), vision-language (VLMs), vision-language-action (VLAs) and other multi-modal foundation models.

We are interested in better approaches for alignment, model architectures, distillation and scalability, in addition to looking at approaches to handle new challenges in planning and compositionality where current LLMs fail.

We are aiming to make progress on some of the hardest scientific challenges around multi-modal foundation models for downstream applications in assistive robotics and across Toyota.

Multi-modal modeling is a core component of our robotics architecture as the team works towards Large Behavior Models.

The Internship

As a Research Intern, you will work with a multidisciplinary team proposing and conducting pioneering research in Machine Learning.

You will use large amounts of text, image, and other data to solve open problems and work towards publications at top academic venues!

Responsibilities

  • Conduct daring research, primarily at the intersection of Natural Language Processing and Computer Vision, that solves open problems of high practical and / or ethical value, and validate it in real-world benchmarks and systems.
  • Push the boundaries of knowledge and the state of the art in areas including language and multi-modal models.
  • Partner with a multidisciplinary team including other research scientists and engineers across the Robotics Machine Learning teams.
  • Stay up to date on the state-of-the-art in Machine Learning ideas and software.
  • Present results in verbal and written communications at international conferences, internally, and via open-source contributions to the community.

Qualifications

  • Currently pursuing a Ph.D. in Machine Learning, Natural Language Processing, Computer Vision, Robotics or related fields.
  • Publications or desire to publish at high-impact conferences / journals (e.g., NAACL, ICLR, NeurIPS, ICML, COLM, TMLR, EMNLP, *ACL etc.

on some of the aforementioned topics.

  • Proficiency with one or more coding languages and systems, preferably Python, Unix, and a Deep Learning framework (e.g., PyTorch).
  • Proficiency in engineering best practices for model and data scaling for large-scale model training.
  • Passionate about modern natural language and multi-modal processing, including training, understanding, and aligning large language models with human values.
  • Ability to work in collaboration with other researchers and engineers to invent and develop interesting research ideas.
  • Ability to execute on research projects, working in collaboration with other members of the team.
  • A reliable teammate who loves to think big, go deeper, and strives to deliver with integrity.

Please add a link to Google Scholar and include a full list of publications when submitting your CV to this position.

The pay range for this position at commencement of employment is expected to be between $45 and $65 / hour for California-based roles;

however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.

Note that TRI offers a generous benefits package including vacation and sick time. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.

J-18808-Ljbffr

11 days ago
Related jobs
Promoted
Robotics Prcocess Automation, LLC
San Jose, California

Specialized Area: Machine learning. Job Title: Machine learning Engineer. Work closely with engineers to build, test, deploy, and troubleshoot machine learning/algorithm-based software. Good understanding of foundational statistics concepts and algorithms: linear/logistic regression, random forest, ...

Promoted
Apple Inc.
Cupertino, California

We’re an applied Machine Learning team that leverages state of the art technologies like generative AI, graph machine learning, and private learning to deliver high quality inferences. Published research in the field of Machine Learning or AI is highly desirable, indicating an ability to not only un...

Promoted
Robotics Prcocess Automation, LLC
Santa Clara, California

Research, design, develop infrastructure for building Deep Learning models including core software supporting cutting edge models, tools for profiling and monitoring training process, hyper-parameter tuning, testing infrastructure. If you are a strong Software Engineer with an aptitude toward Machin...

Promoted
Robotics Technologies LLC
Santa Clara, California

Proven experience in developing deep learning models using TensorFlow, PyTorch, Keras, and similar tools. Strong experience and understanding of modifying and applying supervised/un-supervised machine learning algorithms on visual data. Strong understanding of machine learning; proficient at data an...

ByteDance
San Jose, California

The team focuses on cutting-edge R&D in areas like speech & audio, music processing, natural language understanding and multimodal deep learning. Responsibilities·Conduct cutting-edge machine learning research and development in music understanding and generation. We aim to lead global research and ...

Netflix
Los Gatos, California

As such, our research spans many Machine Learning areas, including recommender systems, causal inference, reinforcement learning, computer vision, computer graphics, image and video processing, natural language processing, optimization, operations research and systems. In Fall 2024, Netflix will be ...

ByteDance
San Jose, California

Team IntroductionThe AML Machine Learning System team combines system engineering and machine learning to develop and operate massively distributed machine learning training, inference systems and services around the world. Responsibilities- Participate in the research and development of machine lea...

Nuro
Mountain View, California

Research experiences we are looking for are generative models and diffusion models for autonomous driving and robotics. You have subject matter expertise and research in one or more of the following areas: scalable generative and diffusion models, generative model alignment with reward/cost models, ...

ByteDance
San Jose, California

Responsibilities:- Develop large language models tailored to specific domains;- Build innovative features using LLM, prompt engineering, or other machine learning technologies;Qualifications- Currently enrolled in a Master or PhD program in computer science or a related technical field;- Proficiency...

Nuro
Mountain View, California

You have subject matter expertise and research in one or more of the following areas: Machine Learning, Deep Learning, Statistics, Probability theory, Robustness theory, Robotics, or Autonomous Driving (preferable). Research new machine learning problems and models. To name a few, using self-supervi...