Search jobs > Seattle, WA > Site reliability engineer

Site Reliability Engineer, Ads Data Platform- USDS

TikTok
Seattle
Full-time

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. . Data Security ( USDS ) is a subsidiary of TikTok in the .

This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep .

users safe. Our focus is on providing oversight and protection of the TikTok platform and . user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained.

The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.

Why Join UsCreation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day. To us, every challenge, no matter how difficult, is an opportunity;

to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At TikTok, we create together and grow together.

That's how we drive impact - for ourselves, our company, and the communities we serve. Join us. Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures.

As a site reliability engineer in the Ads data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world that directly supports the TikTok Ads ecosystem.

You'll need to ensure the data, services and infrastructures are reliable, fault-tolerant, efficiently scalable and cost-effective.

In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager / department.

We regularly review our hybrid work model, and the specific requirements may change at any time. Responsibilities :

  • Engage in and improve the whole lifecycle of service, from inception and design, through to deployment, operation and refinement
  • Ensure reliable, fault-tolerant, efficiently scalable and cost-effective data, services and infrastructures
  • Maintain production services by measuring and monitoring availability, latency and overall system health. Practice sustainable incident response and blameless postmortems.
  • Establish best engineering practice for engineers and non-technical stakeholders
  • Design and implement reliable, scalable, robust and extensible big data systems that support core products and business

Minimum Qualification : - Bachelor's degree in Computer Science, a related technical field involving software or systems engineering, or equivalent practical experience- Experience with algorithms and data structures- Solid communication and collaboration skills- Experience with Big Data technologies such as M / R, Hive, Spark, Metastore, Presto, Flume, Kafka, ClickHouse, Flink etc- Experience with Ads systems is a plus

30+ days ago
Related jobs
Promoted
TikTok
Seattle, Washington

Data Security ("USDS") is a subsidiary of TikTok in the U. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more. Define service level indicators and data-driven ...

Promoted
Splunk
Seattle, Washington
Remote

About the RoleSplunk is looking for an enthusiastic and innovative Principal Software Engineer to join our Observability Data Platform organization. The Data Platform is a large-scale, highly performant, available and reliable system that processes billions of data points per minute. Experience with...

Promoted
TikTok
Seattle, Washington

We are a team of passionate Data Analysts, Data Scientists, and Operations who safeguard TikTok's US user data and join forces with cross-functional teams to derive actionable insights from US user data to maximize monetization results while still giving users a pleasant experience in our app. Data ...

Promoted
CYPRESS GROUP
Bellevue, Washington

You will work closely with a team of skilled engineers, product managers, and data scientists to build a robust system capable of handling high volumes and high transaction rates with utmost reliability and trust. Ensure data integrity and system reliability by implementing best practices in data se...

Promoted
Tik Tok
Seattle, Washington

As Tiktok revenue keeps growing, so are the unique engineering and UX challenges, as an engineering team, we also dedicate ourselves to solving challenging but interesting problems in a more scalable and innovative way through advanced software architecture, engineering practice and cutting-edge alg...

ByteDance
Seattle, Washington

Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fu...

JPMorgan Chase & Co.
Seattle, Washington

Advanced knowledge in site reliability culture and principles with demonstrated ability to implement site reliability within an application or platform. Demonstrates site reliability principles and practices every day and champions the adoption of site reliability throughout your team. Elevate your ...

Kalpita Technologies, Inc.
Redmond, Washington

Ttile : Site Reliability Engineer (SRE) i Location :Redmond, WA Onsite Duration : 6 months Type : W2 Must have Skills: C#,. ...

Apple
Seattle, Washington

Site Reliability Engineering, DevOps, or Infrastructure focused roleExperience supporting internet-facing production services and distributed systemsAbility to implement and coordinate telemetry using monitoring and observability tools such as Splunk, Grafana, and PrometheusCoding experience using a...

ByteDance
Seattle, Washington

Relying on the abundant data and computing resources of the platform, the team has continued to invest in relevant fields and has launched its own general large model, providing multi-modal capabilities. With a suite of more than a dozen products, including TikTok and Helo as well as platforms speci...