Search jobs > New York, NY > Site reliability engineer

Early Career Site Reliability Engineer, Global E-commerce- USDS

TikTok
New York, NY
Full-time

Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.

S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.

S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained.

The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day. To us, every challenge, no matter how difficult, is an opportunity;

to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At TikTok, we create together and grow together.

That's how we drive impact - for ourselves, our company, and the communities we serve. Join us.

About the Team

The Global E-commerce SRE team of US Tech Services works with engineering and product teams to build and run large-scale, globally distributed, observable, fault-tolerant systems.

As an SRE, you will deliver on production ownership and be responsible for observability and automation across complex, large-scale service mesh architectures.

In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager / department.

We regularly review our hybrid work model, and the specific requirements may change at any time.

What You'll Do :

Support the service level of a critical, revenue generating E-commerce platform as well as related infrastructure and services.

This role will focus on service reliability, highly-scalable design and release management in a cloud-native environment.

  • Implement SRE practices around incident management, post-mortems while being part of on-call rotations.
  • Define service level indicators and data-driven objectives to uphold and improve uptime, latency, and system health of a core TikTok production platform.
  • Collaborate cross team with engineering and product to ensure that key requirements (such as capacity planning and launch reviews) are performed to enable transparent service delivery to customers.
  • Automation geared towards efficiency, scalability and service resiliency

Qualifications

Minimum Qualifications :

Good understanding of Unix / Linux operating systems internals and networking

Experience writing code in Java, Go, Python or a similar language

Familiarity with large-scale system design components (Redis, Elasticsearch, Kafka, Druid, Hadoop, Flink or comparable solutions), relational databases, caching solutions and web service frameworks

Experience with algorithms, data structures, complexity analysis and software design

Experience developing tools and APIs to reduce manual interaction with systems and applications using a variety of coding and scripting standards

Systematic problem-solving approach, coupled with effective communication skills and a sense of drive

Preferred Qualifications :

Familiarity with running production grade web services at scale and understanding cloud native technologies and networking

Candidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy.

To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

We are passionate about this and hope you are too.

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.

If you need assistance or a reasonable accommodation, please reach out to us at https : / / shorturl.at / ktJP6

This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.

17 days ago
Related jobs
Promoted
Vimeo
New York, New York

Come work on the Site Reliability Engineering team at Vimeo! Your job will be to design, develop, deploy, maintain, and optimize the platform that powers an application that is part of the infrastructure of the Internet: Vimeo. SRE at Vimeo spans the domains of platform engineering, database adminis...

Promoted
TENTH MOUNTAIN LLC
New York, New York

Proven experience as a Senior DevOps Engineer or Site Reliability Engineer in an Agile environment. Join KGES as a Lead Site Reliability Engineer. As a Lead Site Reliability Engineer, you'll have the opportunity to leverage your expertise in DevOps, cloud infrastructure, and incident management,...

Lorven technologies
New York, New York

Hi,</b></p> <p> </p> <p>Our client is looking <b>Site Reliability Engineer (SRE) </b>for<b> Long Term Contract </b>project in<b> Charlotte NC, NJ or NY (Hybrid) </b>below is the detailed requirements. MsoNoSpacing"> </p...

Figma
Queens, New York

More than anything, we seek engineers with strong coding fundamentals and a track record of high quality engineering. We strive to foster a positive, inclusive culture - you can read more about our engineering values on our blog. Craft performance objectives with your manager that align to company p...

Celonis
New York, New York

You will be part of a highly technical, collaborative and creative team, with a focus on SRE & Software Engineering. Responsible for the design, implementation, reliability and management of cloud-based FedRAMP-compliant applications and platforms. Computer Science, Software Engineering) or a co...

NBCUniversal
New York, New York
Remote

NBCUniversal has an opening for a Site Reliability Engineer focused primarily on, but not limited to, supporting live channel origination on the Distribution Engineering team within the NBCU Operations and Technology division. Assist in the design, analysis, or evaluation of assigned projects using ...

DApp360 Workforce LLC
New York, New York
Remote

DApp360 Workforce is recruiting for an experienced and talented Remote Site Reliability Engineer with experience building and designing systems, monitors, tools, frameworks, and methodologies to ensure the reliability of trading platforms. Define “rules of the road” for DevOps engineers to...

iHeartMedia
New York, New York

The Senior Site Reliability Engineer will be responsible for leading a talented team of SREs/DevOps Engineers across a wide variety of Cloud Services. Run Reliability Incident management processes along with Root Cause Analysis, developing Runbooks . ...

Squarespace
New York, New York

The Infrastructure Engineering teams are looking for an experienced and passionate software engineer to help ensure that customers worldwide can access Squarespace products quickly and reliably. We work with the product teams to maintain the reliability of our system, using a fleet of microservices,...

Capital One
New York, New York

As a Capital One Lead Software Engineer, Site Reliability Engineer you’ll have the opportunity to be on the forefront of driving a major transformation within Capital One. Ave (22114), United States of America, New York, New YorkLead Software Engineer, Site Reliability (Bank Tech). New York City (Hy...