Search jobs > Mountain View, CA > Site reliability engineer

Site Reliability Engineer, Data Platform- USDS

TikTok
Mountain View, CA
Full-time

Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

Join us.

About USDS

At TikTok, we're committed to a process of continuous innovation and improvement in our user experience and safety controls.

We're proud to be able to serve a global community of more than a billion people who use TikTok to creatively express themselves and be entertained, and we're dedicated to giving them a platform that builds opportunity and fosters connection.

We also take our responsibility to safeguard our community seriously, both in how we address potentially harmful content and how we protect against unauthorized access to user data.

U.S. Data Security ("USDS") is a standalone department of TikTok in the U.S. This new security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.

S. users safe. Our focus is on providing oversight and protection of the TikTok platform and user data in the U.S., so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained.

The teams within USDS that deliver on this commitment daily span Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.

Team Introduction

TikTok's Data Platform Team focuses on challenges in the areas of data infrastructure and data products. The team is in charge of various aspects including Query Engine, Logging and Data Ingestion Infra, Experimentation Platform, as well as Workflow Management Platform.

The goal is to support ad-hoc / interactive queries, batch pipelines, logging and ingesting large amounts of realtime data, and supporting A / B testing for all product features launches.

In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager / department.

We regularly review our hybrid work model, and the specific requirements may change at any time.

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures.

As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data platforms in the world.

You'll need to ensure the data, services and infrastructures are reliable, fault-tolerant, efficiently scalable and cost-effective.

You'll also have the opportunity to design, build and deliver all kinds of systems as a software engineer.

  • Engage in and improve the whole lifecycle of service, from inception and design, through to deployment, operation and refinement
  • Ensure reliable, fault-tolerant, efficiently scalable and cost-effective data, services and infrastructures
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health. Practice sustainable incident response and blameless postmortems.
  • Establish best engineering practice for engineers as well as non-technical people
  • Design and implement reliable, scalable, robust and extensible big data systems that support core products and business

Qualifications

  • BS or MS degree in Computer Science or related technical field or equivalent practical experience
  • Experience in the Big Data technologies(Hadoop, M / R, Hive, Spark, Metastore, Presto, Flume, Kafka, ClickHouse, Flink etc.)
  • Experience with performing data analysis, data ingestion and data integration
  • Solid communication and collaboration skills

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy.

To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

We are passionate about this and hope you are too.

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.

If you need assistance or a reasonable accommodation, please reach out to us at redacted

30+ days ago
Related jobs
Promoted
TikTok
Mountain View, California

As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest dataplaforms in the world that directly supports the TikTok app. Site Reliability Engineering (SRE) combines software and systems engineering to bu...

Promoted
Inworld AI
Mountain View, California

DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience). Our Technical Operations team manages the infrastructure, DevOps, and Site Reliability of our platform. We are looking for a Staff Cloud DevOps/Site Reliability Engineer to join our...

Promoted
ADAPT Technology
Mountain View, California

The Role: Principal Engineer/Researcher – AI and Data Platforms. ETL, streaming), data integration, data processing, data analytics. Computer Science, Data Science, Information/Data Management). AI research, data infrastructure. ...

Promoted
TikTok
San Jose, California

The Data Management Suite team is building products that cover the whole lifecycle of data pipeline, including data ingestion and Integration, data development, data catalog, data security and data governance. As a software engineer in the data management suite team, you will have the opportunity to...

ByteDance
San Jose, California

Join our innovative Site Reliability Engineering (SRE) team that merges software development with infrastructure operations to manage large-scale, highly distributed systems. Key Responsibilities:- Develop and implement AI-based software for efficient and intelligent management of service-oriented a...

Palo Alto Networks
Santa Clara, California

We are seeking development heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio in. As a member of the SRE team, you will work on producing mission-critical platforms, tools, and processes that will ensu...

Guidewire
San Mateo, California

Prior experience of building data platforms using Big Data stack (Kafka, Hadoop, Spark, Flink, Hive. Design cloud-native data platform and analytics SaaS services. Understanding of distributed systems concepts and principles (consistency and availability, liveness and safety, durability, reliability...

ByteDance
San Jose, California

With a suite of more than a dozen products, including TikTok and Helo as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. With the mission of making content creatio...

Apple
Sunnyvale, California

Would you like to work in an energizing environment where your abilities will be challenged on a day-to-day basis? If so, Apple's IS&T Ai & Data Platforms team is looking for highly motivated, detail oriented, technical savvy, results-oriented professionals who like to think creatively and want to b...

NetApp
San Jose, California

Title: Site Reliability Engineer. Manages, supports and maintains a reliable environment for the site in order to ensure the stability and security of multiple open-source systems/platforms that are run or operated in that environment. Building and supporting a reliable site for the environment in o...