Search jobs > San Jose, CA > Lead machine learning

Tech Lead Machine Learning Ops, Global SRE

TikTok
San Jose, CA
Full-time

Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

Join us.

MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on.

Responsibilities

1) Responsible for setting SLOs of online machine learning serving systems, maintaining the stability of the online serving systems.

2) Responsible for maintaining stability of offline machine learning training tasks, improving the success rate of the training tasks.

3) Responsible for rolling out GPU model training in Non-Chin regions.

4) Responsible for stability of AIGC related machine learning tasks.

5) Responsible for resource management and planning of machine learning resources, including : cost and budget, resource efficiency enhancement, offline and online resources tides, etc.

Qualifications

Minimum requirements

1) Bachelor's degree in Computer Science or Software Engineering, similar technical field of study, or equivalent practical experience.

2) Expertise in Linux operating systems, networking, storage.

3) Experience programming in at least one of the following programming languages : Python, Go, C, C++, or Java.

4) Experience in troubleshooting application issues, or production operations.

5) Effective communication skills and a sense of ownership and drive.

Preferred qualifications :

1) Experience in SRE of machine learning systems.

2) Experience in SRE of ads / recommendation / search systems.

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy.

To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

We are passionate about this and hope you are too.

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.

If you need assistance or a reasonable accommodation, please reach out to us at redacted

30+ days ago
Related jobs
Promoted
TikTok
San Jose, California

Optimize user experiences during creation via a deep understanding of user behavior and state-of-the-art machine learning technologies. Strong theoretical knowledge and practical experience in machine learning. Familiarity with one or more areas in machine learning, computer vision, natural language...

Promoted
Acceler8 Talent
CA, United States

Join us as a Lead ML Engineer on the mission to be the leading computing platform for Artificial General Intelligence (AGI). Keywords: LLMs, Large Language Models, Neural Networks, Machine Learning, ML Research. As an Lead ML Engineer, you will contribute to the forefront of AGI research by shaping ...

Promoted
TikTok
San Jose, California

We're looking for innovative Senior Machine Learning Engineers to join our TikTok Ads Core team. Our team is a platform team that develops state-of-art ad technologies, including ranking, retrieval, targeting, bidding, auction, etc. Build highly scalable machine learning systems/models to improve ad...

Promoted
Harnham
CA, United States

Lead Machine Learning Engineer. As a Lead Machine Learning engineer you will…. As a Lead Machine Learning Engineer, you can expect a base salary between $200,000 to $240,000 (based on experience) plus competitive benefits. TEAM: Machine learning and data science is the core part of the company. ...

Promoted
TikTok
San Jose, California

We are a group of applied machine learning engineers and research scientists that focus on E-commerce video/live-streaming recommendations on the major traffic source of Tiktok ForU page, where we serve traffic for billions of users every single day. We are interested and excited about applying larg...

TikTok
San Jose, California

Master or above degree in computer science, statistics, or other relevant, machine-learning-heavy majors. Familiarity with data mining, data science, machine learning or finance/social platform risk defense. Bonus given if possessing at least one advantage among risk control, data science, machine l...

ByteDance
San Jose, California

Proficiency in modern machine learning theories and applications, including ensemble trees, deep neural networks, transfer/multi-task learning, reinforcement learning, graph theory, and unsupervised learning. RESPONSIBILITIES - Develop and implement innovative machine learning algorithms to manage b...

TikTok
San Jose, California

Ability to effectively communicate technical concepts to non-technical audiences. Collaborate with PM and R&D teams globally, in a fast-paced environment. Minimum Qualifications:- Bachelor or higher degree in Computer Science or related technical discipline- 5 years of experience developing highly s...

Hireio, Inc.
San Jose, California

Bachelors or higher degree in Computer Science or related technical discipline; - 2+ years of experience managing or tech-leading a software engineering team; - 5+ years of experience working in large-scale system development and solid track of records; - Strong communication and teamwork skills; - ...

ByteDance
San Jose, California

The Frontend team in Global Payments is committed to building a highly integrated payment product that can be easily connected to the Web, App, cross-platform and other forms of Global Payment capabilities. We lead with curiosity and aim for the highest, never shying away from taking calculated risk...