Search jobs > Mountain View, CA > Site reliability engineer

Site Reliability Engineer, Edge - USDS

TikTok
Mountain View
Full-time

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. . Data Security ( USDS ) is a subsidiary of TikTok in the .

This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep .

users safe. Our focus is on providing oversight and protection of the TikTok platform and . user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained.

The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.

Why Join UsCreation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day. To us, every challenge, no matter how difficult, is an opportunity;

to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At TikTok, we create together and grow together.

That's how we drive impact - for ourselves, our company, and the communities we serve. Join us. Team Insight : CDN Site Reliability Engineering combines software and network engineering with system operations to build and run large-scale, massively distributed infrastructure.

Our Edge SREs ensure infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. We dive deep into the stack, including network, OS, and applications, to quickly resolve complex functional and performance issues.

In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager / department.

We regularly review our hybrid work model, and the specific requirements may change at any time. Responsibilities : - Architect and implement solutions that enable both internal and external customers to harness the power of TikTok’s content delivery network.

  • Contribute to data pipelines, tools, automations, visualizations and monitors to facilitate the operation and optimization of edge services.
  • Data monitoring and alerting, data quality assurance and anomaly detection.- Document team processes and policies, including methods of engagement and SLOs.
  • Analyze, design and implement solutions at the system level to remove bottlenecks and improve edge service performance.
  • Implement monitoring and alerting to improve issue detection and response.- Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.

Minimum Qualifications : - Bachelor's degree with 2+ years of experience in Computer Engineering, Computer Science, or related fields, or equivalent experience.

  • 2+ years working experience in the field of CDN performance and traffic engineering, network solution architecting or network-focused site reliability engineering roles.
  • Experience in networking technologies such TCP / IP, BGP, DNS, etc. in a carrier-grade environment. Past experience with CDN technologies.
  • 2+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.
  • Strong analytical skills and the ability to solve real world problems in a fast moving environment. Preferred Qualifications : - Experience in operating in a multi-CDN environment.
  • Understanding of IPv6 and IPv4-IPv6 coexistence technologies.- Self-driven and capable of working with ambiguity and moving projects from concept to delivery.
  • Experience in designing, analyzing and building automation and tools for large scale systems.
  • 2 days ago
Related jobs
Promoted
TikTok
Mountain View, California

Build and manage a team of software/reliability engineers, including mentoring junior team members and supporting a team on career development. Data Security (“USDS”) is a subsidiary of TikTok in the U. The Global E-commerce SRE team of US Tech Services works with engineering and product teams to bu...

Promoted
Palo Alto Networks
Santa Clara, California

We are seeking experienced senior level Software Engineers to develop and deliver next-generation technologies within our Prisma Access Edge Platform team. DevOps Engineer (or equal role) with a passion for technology and strong motivation and responsibility for high reliability and service level. U...

Promoted
Western Digital Capital
San Jose, California

As a Secure Development Factory (SDF) Site Reliability Engineer - DevOps, you will be at the heart of Western Digital’s engineering process, delivering the software development tools and infrastructure that empowers engineering teams to develop and deliver high quality products quickly. Site Reliabi...

Promoted
Palo Alto Networks
Santa Clara, California

As a Site Reliability Engineer, you will be part of a team supporting the services running on this infrastructure. Experience in Production Engineering, DevOps, or Site Reliability. This includes automation, architecture, performance, metrics, troubleshooting, security, and reliability. Our engineer...

Promoted
Rubrik
Palo Alto, California

Senior Site Reliability Engineers at Rubrik are systems/software engineers who ensure that Rubrik's infrastructure services run smoothly and have the capacity for future growth. As a Senior Site Reliability Engineer, you will be responsible for:. Minimum 3-5 years of experience as a Development, Dev...

Palo Alto Networks
Santa Clara, California

As a Senior Staff DevOps Engineer for the CDL/SLS team, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture, performance, observability, troubleshooting, security, and reliability. Infrastructure, Operations, DevOps, or System Eng...

Trianz
San Jose, California

We seek a highly skilled and dynamic Site Reliability Engineer – Consultant In this role you will·Maintain and improve the reliability, performance, and availability of software systems. IT Infrastructure experience ·Extensive experience working with linux flavors like rhel/centos os, shells, filesy...

Hireio, Inc.
San Jose, California

Site Reliability Engineering(SRE) team. Scale systems sustainably through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for changes. ...

ByteDance
San Jose, California

Currently we are looking for Site Reliability Engineers to join our team to support and advance that mission What You'll Do Site Reliability Engineering (SRE) of AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run massively distributed A...

Splunk Inc
San Jose, California
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Splunk's Cloud Services group is looking for a Site ReliabilityEngineer to help lead, design and b...