Search jobs > San Jose, CA > Research engineer ai

Research Engineer, Generative AI Model Optimization

TikTok
San Jose, CA
Full-time

Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

Join us.

Team Introduction

The AI Platform team is a team focusing on building advanced end-to-end production pipelines with core technologies including generative AI, intelligent editing and content understanding.

By utilizing AI model training, optimization, deployment and applications, we provide cutting-edge AI capabilities to empower content creation and consumption on TikTok and serve billions of users.

We are looking for strong research engineers to design and develop state-of-the-art generative AI model optimization systems and solutions.

The primary goal is to reduce the cost of generative AI applications while preserving quality and efficiency.

Responsibilities

  • Design and develop state-of-the-art optimization framework and solutions for generative AI model training and inference, utilize techniques including but not limited to parallelism, quantization, distillation and so on.
  • Research and develop cutting-edge distributed model training and inference optimization technologies, such as data parallelism, model parallelism etc.
  • Communicate and collaborate with cross-functional teams to clarify model performance requirements under different use cases, make reasonable trade-offs to provide the most cost-efficient model optimization solutions to TikTok product teams to support content creation and consumption.
  • Built platforms and tools that standardize common optimization methods to enlarge the usability of model optimization technologies among internal developers and researchers.

Qualifications

Minimum Qualifications

  • M.S. or Ph.D. in Computer Science or related fields with 3-5 years of experience in software development.
  • Proficient in one or more programming languages, such as Python, C, or C++. Familiar with common data structures and basic algorithms.
  • Deep knowledge in machine learning optimizations such as quantization, pruning, knowledge distillation, Neural Architecture Search (NAS), and optimizations for large models.
  • Familiar with one or more open-source deep learning frameworks, such as PyTorch, TensorFlow, PaddlePaddle etc. Understand the underlying design of the framework, familiar with common model architectures.

Preferred Qualifications

  • Experience in AutoML algorithms or model acceleration technologies.
  • Experience in contributing to open-source projects is preferred.
  • Ability to work collaboratively with global stakeholders

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy.

To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

We are passionate about this and hope you are too.

9 days ago
Related jobs
Promoted
Capital One
San Jose, California

We are looking for an experienced Senior Generative AI Engineer to help build and maintain APIs and SDKs to train, fine-tune and access AI models at scale. Senior Engineer - Generative AI Product Engineering. You will work as part of our Enterprise AI team and build systems that will enable our user...

Promoted
Capital One
San Jose, California
Remote

Lead Engineer, Generative AI Infrastructure to help us build the foundations of our AI capabilities. Senior Lead Engineer - Generative AI Infrastructure (Remote-Eligible). You will work on a wide range of initiatives, whether that’s building large-scale distributed training clusters, or deploying LL...

Promoted
Capital One
San Jose, California
Remote

Lead Engineer to help build and maintain APIs and SDKs to train, fine-tune and access AI models at scale. Senior Lead Engineer - Generative AI Product Engineering (Remote Eligible). You will work as part of our Enterprise AI team and build systems that will enable our users to work with Large-Langua...

Promoted
Capital One
San Jose, California
Remote

We are looking for an experienced Senior Generative AI Engineer to help build and maintain APIs and SDKs to train, fine-tune and access AI models at scale. Senior Engineer - Generative AI Product Engineering (Remote-Eligible). You will work as part of our Enterprise AI team and build systems that wi...

Promoted
Hireio, Inc.
San Jose, California

We are looking for strong tech lead software engineers to drive the design and implementation of our generative AI systems consisting of model training and optimization, deployment with efficient hardware consumption, and applications to user-facing products for image/video processing and interactiv...

ByteDance
San Jose, California

The team is a mix of experienced research scientists and engineers, aiming to advance the research boundaries in foundation models and apply our technologies to our rich application scenarios, whereas a feedback loop is created to help further improve our foundation technologies. We conduct cutting-...

Advanced Micro Devices, Inc
San Jose, California

AMD together we advance_ Staff Applied Machine Learning Software EngineerGenerative AI The Role The AI Group at AMD is searching for talented and motivated engineers and scientists to work on Generative AI inference solutions. Staff Applied Machine Learning Software EngineerGenerative AI The R...

ByteDance
San Jose, California

The team builds AI training and inference systems based on GPUs and advances the state-of-the-art of AI system technologies to accelerate large audio/music language models. The team is also responsible for the development of the complete engineering cycle of large models, including data preparing/pr...

AMD
San Jose, California

You will explore and improve upon state-of-the-art research in both academia and industry and innovate in the areas of software development, model optimization and compression algorithms for Generative AI applications such as LLMs, Stable Diffusion and Multi Modal Models. The AI Group at AMD is sear...

Apple
Cupertino, California

Experience with state-of-the-art NLP algorithms and AI models, Multi-modal LLMs, Multi-modal contrastive learning, Foundation models, Diffusion based models and parameter efficient fine tuning of LLMs. We are looking for a highly skilled and experienced AI Architect who has a robust understanding of...