Search jobs > New York, NY > Machine learning engineer

Machine Learning Engineer - Data Curation - AIGC, TikTok Monetization GenAI

TikTok
New York, NY
Full-time

Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

Join us.

We are Generative AI team under Monetization Technology. Our team focuses on developing cutting-edge Generative AI techs across all modalities, including text, image, videos, landing pages, etc.

and creates industry-leading technical solutions to improve creative efficiency for advertisers, agencies and creators.

We are committed to automated creative workflows by leveraging Generative AI technologies, to increase overall revenue for advertisers, agencies and creators.

We aim to drive and lead the generative AI in the ads tech and creative industry, powering products and driving values for our clients, creators, and the whole ecosystem.

We are looking for infrastructure engineers who are excited to grow their business understanding, build highly scalable and reliable software / infrastructure, partner across functions with global teams, and make big impacts.

If you are someone who welcomes challenges, we are eager to have you on the team!

Responsibilities :

  • Collaborate with data scientists and researchers to create and maintain efficient, low-latency data pipelines for machine learning model training.
  • Design and implement a robust, scalable data curation and management system to facilitate foundational model training across text, image, and video formats in distributed environments.
  • Work alongside founder model developers to expedite the development / deployment of advanced large language models.
  • Stay informed about the latest trends and innovations in academia and the open-source community, adopting new technologies to improve data operations and ML model performance.

Qualifications

Minimum Qualifications :

1. B.S. / M.S. / Ph.D. in Computer Science, Computer Engineering, or a related field.

2. Programming and Technical Proficiency : Expertise in Python and a strong foundation in deep learning frameworks, such as PyTorch, as well as large model training libraries like FSDP / DeepSpeed and asyncio.

A minimum of 3 years' experience with Linux, Docker, and Kubernetes is required.

3. Data Engineering and AI / ML Knowledge : Demonstrated capability in data curation, management, and optimization within Generative AI ecosystems, encompassing both streaming and batch data processing.

This includes a thorough understanding of machine learning frameworks, parallel data processing techniques, and proficiency with large language models (e.

g., Llama series), text to image (e.g., Diffusion-Based Models, Diffusion Transformers), and text to video technologies (e.g., EMU series, MagViT).

Preferred Qualifications :

1. Advanced Technical Expertise : Experience in CUDA Optimization and a deep understanding of the application of Generative AI models across multiple domains.

2. Cloud Computing and Distributed Systems : Significant experience in managing large-scale data systems, with a strong preference for those who have worked with Vector Database solutions.

Proficiency in cloud services (AWS / GCP) and familiarity with machine learning training, deployment, and distributed computing frameworks like Spark.

3. Interpersonal and Problem-Solving Skills : A demonstrated passion for technology, coupled with outstanding problem-solving capabilities.

Exceptional communication, teamwork, and project management skills are essential, along with a resilient character.

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy.

To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

We are passionate about this and hope you are too.

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.

If you need assistance or a reasonable accommodation, please reach out to us at https : / / shorturl.at / cdpT2

30+ days ago
Related jobs
Promoted
TikTok
New York, New York

Engineer robust, high-performance data processing and large language model training/inference pipelines, drive engineering excellence and optimization initiatives to ensure the most effective use of resources, including cost optimization and performance tuning of the ML platform. Provide a cutting-e...

Promoted
Equation Staffing
New York, New York

Acquire data from primary or secondary data sources and maintain databases/data systems by working with the data procurement team. The need role is for a data-savvy individual to join the Data Science team as the first Data Analyst, on the team. Will use big data and machine learning to help clients...

Promoted
Oakwell Hampton
New York, New York

Are you ready to make an impact in the AI and machine learning space? My client is a fast-growing, mission-driven startup revolutionizing the way businesses process and analyze data. Develop and fine-tune machine learning models. Oversee the entire ML development lifecycle, including data gathering,...

Promoted
Stuut
New York, New York

We are looking for frontend, full stack, and machine learning engineers who are excited to be part of our founding story and help us build a diverse and vibrant tech community. About the Role Stuut is, at its core, an engineering company, and is on a mission to build the best engineering team. We hi...

Promoted
Etsy
Brooklyn, New York

Internet serving infrastructure, machine learning platforms, machine learning services and frameworks. We are looking for a Staff Software Engineer to join the GenAI Enablement squad to play a major role in driving the platform vision and strategy for the GenAI capabilities at Etsy. Have worked well...

Maania Consultancy Services
New York, New York

Required Qualifications: Must have 2 GCP certifications (Data Engineer, Machine Learning, & or Cloud). ...

S&P Global
New York, New York

We work alongside product teams across MI ES on break-through ideas using tools and techniques spanning the entire spectrum of Data Science, Statistics, Machine Learning, Deep Learning, Gen AI, Operations Research, Data and Machine Learning Engineering. The candidate will join a growing team of Data...

WarnerMedia Services, LLC
New York, New York

Staff Machine Learning Engineer on the CNN Machine Learning & Science Team, you will. The Machine Learning team at CNN Digital is dedicated to research, build and evaluate Machine Learning and AI capabilities at CNN. The team is comprised of talented ML Engineers, Data Scientists and ML Platform Eng...

Inuson International Inc. (i3
New York, New York
Remote

Through our firstproduct Grass we provide universal access to public web data and asystem where ordinary people can participate in the process andshare in the benefits. Bachelor s Master s or Doctoral degree in Data ScienceComputer Science Statistics or a relatedfield. A minimum of 3 years of workor...

Altice USA
Queens, New York

Machine Learning Engineers work to deploy end-to-end solutions to business problems leveraging AI and/or ML principles as needed to create those solutions. Degree in a quantitative discipline, such as Data Science, Applied Mathematics, Statistics, Economics, Operations Research, Computer Science, Ma...