Search jobs > San Jose, CA > Internship > Software engineer backend

Backend Software Engineer Intern (TikTok Data Ecosystem, Data Lake) - 2025 Summer (BS/MS)

TikTok
San Jose
Full-time

Team Introduction

The TikTok Data Ecosystem Team has the vital role of crafting and implementing a storage solution for offline data in TikTok's recommendation system, which caters to more than a billion users.

Their primary objectives are to guarantee system reliability, uninterrupted service, and seamless performance. They aim to create a storage and computing infrastructure that can adapt to various data sources within the recommendation system, accommodating diverse storage needs.

Their ultimate goal is to deliver efficient, affordable data storage with easy-to-use data management tools for the recommendation, search, and advertising functions.

We are looking for talented individuals to join us for an internship in 2025. Internships at TikTok aim to offer students industry exposure and hands-on experience.

Turn your ambitions into reality as your inspiration brings infinite opportunities at TikTok. Internships at TikTok aim to provide students with hands-on experience in developing fundamental skills and exploring potential career paths.

A vibrant blend of social events and enriching development workshops will be available for you to explore. Here, you will utilize your knowledge in real-world scenarios while laying a strong foundation for personal and professional growth.

This Internship Program runs for 12 weeks beginning in May / June 2025. Successful candidates must be able to commit to one of the following summer internship start dates below : Monday, May 12Monday, May 19Tuesday May 27 (Memorial Day May 26)Monday, June 9Monday, June 23 We will prioritize candidates who are able to commit to these start dates.

Please state your availability clearly in your resume (Start date, End date). Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply.

The application limit is applicable to TikTok and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early.

Online AssessmentCandidates who pass resume evaluation will be invited to participate in TikTok's technical online assessment in HackerRank.

Responsibilities -Design and implement an offline / real-time data architecture for large-scale recommendation systems.-Design and implement a flexible, scalable, stable, and high-performance storage system and computation model.

  • Troubleshoot production systems, and design and implement necessary mechanisms and tools to ensure the overall stability of production systems.
  • Build industry-leading distributed systems such as offline and online storage, batch, and stream processing frameworks, providing reliable infrastructure for massive data and large-scale business systems.

Minimum Qualifications : - Pursuing a Bachelor's Degree or above, majoring in Computer Science, or related fields, with experience building scalable systems.

  • Proficiency in common big data processing systems like Spark / Flink at the source code level is required, with a preference for experience in customizing or extending these systems;
  • A deep understanding of the source code of at least one data lake technology, such as Hudi, Iceberg, or DeltaLake, is highly valuable and should be prominently showcased in your resume, especially if you have practical implementation or customization experience;
  • Knowledge of HDFS principles is expected, and familiarity with columnar storage formats like Parquet / ORC is an additional advantage;
  • Prior experience in data warehousing modeling;- Proficiency in programming languages such as Java, C++, and Scala is essential, along with strong coding skills and the ability to troubleshoot effectively;

Preferred Qualifications - Experience with other big data systems / frameworks like Hive, HBase, or Kudu is a plus;- A willingness to tackle challenging problems without clear solutions, a strong enthusiasm for learning new technologies, and prior experience in managing large-scale data (in the petabyte range) are all advantageous qualities.

30+ days ago
Related jobs
Promoted
Tik Tok
San Jose, California

Regarding code, developed LLM aims to automatically re-organize/optimize Tiktok codebase, and make the code/coding become more accessible for Tiktok engineers. In short, our team is a foundation model LLM group in Tiktok, aiming to explore large-scale&multi-modal LLMs and optimize systems to the lev...

Promoted
Intelliswift Software
CA, United States

Design and implement scalable data pipelines using Databricks and Spark. Integrate and manage data streams using Kafka. Collaborate with data scientists and analysts to understand data requirements and deliver solutions. Ensure data quality and integrity across various data sources. ...

Promoted
TikTok
San Jose, California

We are looking for software engineers who are excited to grow their business understanding, build highly scalable and reliable software/infrastructure, partner across functions with global teams, and make big impacts. The TikTok Ads Creative & Ecosystem team's mission is to solve the above dilem...

Promoted
VirtualVocations
Santa Clara, California

A company is looking for a Senior Data & Analytics Solutions Engineer, Direct Analytics. ...

TikTok
San Jose, California

In this role, you will:- Prototype new ideas and iterate towards the best developer experience;- Build, optimize, and scale the next generation of our automated build/test/deploy system;- Write high-quality, reusable code, and iterate towards the best developer experience;- Define and prioritize req...

Promoted
TikTok
San Jose, California

We are looking for passionate mobile software engineers to join us and to develop ads product on TikTok, including content ads, ads format, new surfaces monetization, vertical solutions, etc. This is doubly true of the teams that make TikTok possible. Working closely with product team, our mission i...

TikTok
San Jose, California

Ads/Monetization Technology teams are building the next-generation monetization platforms to help millions of customers grow their businesses, utilizing our products like TikTok. This is doubly true of the teams that make TikTok possible. Your profile will be reviewed by multiple teams within Moneti...

ByteDance
San Jose, California

With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. We are looking for Software Engin...

TikTok
San Jose, California

Responsibilities We are looking for Research Interns to help us identify new opportunities and develop scientifically sound systems to improve technological guarantees for user privacy and advance the efficient frontier of privacy and utility in our systems, contribute to the development of librarie...

TikTok
San Jose, California

Responsibilities:-System Stability: Responsible for the stability of US business in TikTok content e-commerce, providing tools and governance ideas for the business, and participating in the construction of relevant stability systems. Have experience in developing large-scale distributing systems, f...