Search jobs > San Jose, CA > Lead machine learning

Tech Lead Machine Learning Ops, Global SRE

TikTok
San Jose, CA
Full-time

Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

Join us.

MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on.

Responsibilities

1) Responsible for setting SLOs of online machine learning serving systems, maintaining the stability of the online serving systems.

2) Responsible for maintaining stability of offline machine learning training tasks, improving the success rate of the training tasks.

3) Responsible for rolling out GPU model training in Non-Chin regions.

4) Responsible for stability of AIGC related machine learning tasks.

5) Responsible for resource management and planning of machine learning resources, including : cost and budget, resource efficiency enhancement, offline and online resources tides, etc.

Qualifications

Minimum requirements

1) Bachelor's degree in Computer Science or Software Engineering, similar technical field of study, or equivalent practical experience.

2) Expertise in Linux operating systems, networking, storage.

3) Experience programming in at least one of the following programming languages : Python, Go, C, C++, or Java.

4) Experience in troubleshooting application issues, or production operations.

5) Effective communication skills and a sense of ownership and drive.

Preferred qualifications :

1) Experience in SRE of machine learning systems.

2) Experience in SRE of ads / recommendation / search systems.

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy.

To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

We are passionate about this and hope you are too.

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.

If you need assistance or a reasonable accommodation, please reach out to us at redacted

30+ days ago
Related jobs
Promoted
TikTok
San Jose, California

We aim to drive and lead the generative AI in the ads tech and creative industry, powering products and driving values for our clients, creators, and the whole ecosystem. TikTok is the leading destination for short-form mobile video. TikTok has global offices including Los Angeles, New York, London,...

TikTok
San Jose, California

Responsible for the development of state-of-the-art applied machine learning projects. TikTok is the leading destination for short-form mobile video. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul, and Tokyo. We lead with curiosity ...

Promoted
TikTok
San Jose, California

We're looking for innovative Senior Machine Learning Engineers to join our TikTok Ads Core team. Our team is a platform team that develops state-of-art ad technologies, including ranking, retrieval, targeting, bidding, auction, etc. Build highly scalable machine learning systems/models to improve ad...

Annapurna Labs (U.S.) Inc.
Cupertino, California

Although we build and deploy machine learning chips, no machine learning background is needed for this role. Custom silicon chips live at the heart of AWS Machine Learning servers, and our team builds the backend software to run these servers. Enjoy learning new technologies, building software at sc...

Promoted
TikTok
San Jose, California

Master or above degree in computer science, statistics, or other relevant, machine-learning-heavy majors. Familiarity with data mining, data science, machine learning or finance/social platform risk defense. Bonus given if possessing at least one advantage among risk control, data science, machine l...

ByteDance
San Jose, California

The Frontend team in Global Payments is committed to building a highly integrated payment product that can be easily connected to the Web, App, cross-platform and other forms of Global Payment capabilities. We lead with curiosity and aim for the highest, never shying away from taking calculated risk...

TikTok
San Jose, California

TikTok is the leading destination for short-form mobile video. TikTok's global headquarters are in Los Angeles and Singapore, and its offices include New York, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo. SREs in our team keep the systems up and running with the highest level of ...

Apple
Sunnyvale, California

Apple’s Applied Machine Learning team has built systems for a number of large-scale data science applications. Join Apple's Applied Machine Learning Team, as a Senior Software Engineer, to enable GenAI across our Applications & Platforms. We use the latest in open source technology and as committers...

TikTok
San Jose, California

Responsibilities:- Provide technical leadership and build high performing teams within your organization, including mentoring junior team members and providing technical support for engineers on the team. Minimum Qualifications:- BS/MS degree in Computer Science or equivalent majors with experience ...

ByteDance
San Jose, California

We are looking for a Tech Lead Manager to build the large scale distributed system for our global e-commerce supply chain team. Global e-commerce supply chain team aims to improve customer experience with warehouse & supply chain technology, empower the merchants to ease and expand their business wo...