Search jobs > Seattle, WA > Engineering manager

Engineering Manager Machine Learning Infrastructure

ByteDance
Seattle
Full-time

ResponsibilitiesFounded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join UsCreation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible.

Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity;

to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together.

That's how we drive impact - for ourselves, our company, and the users we serve. Join us. The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company.

We also drive substantial impact on core businesses of the company. Currently, we are looking for Engineer Manager - Machine Learning Infrastructure to join our team to support and advance that mission.

Responsibilities : - Lead the team to design and implement distributed inference / training / scheduling / ochestration / storage / parameter server infrastructure for feeds, ads and search ranking models.

  • Oversee the development of monitoring and management tools to ensure the reliability and scalability of machine learning infra.
  • Manage the identification and prioritization of system inefficiencies and bottlenecks, leading efforts to enhance system performance.
  • Lead the team in creating tools to analyze bottlenecks and sources of instability, formulating and implementing effective solutions.
  • Collaborate with product teams, offering comprehensive solutions tailored to their specific requirements. Job requirements- Experience in leading an engineering team- Experience in developing and deploying large-scale machine learning systems.
  • Strong sense of responsibility and good at communication and teamwork- Passionate about solving complex and challenging problemsQualifications- Experience contributing to an open sourced machine learning framework (tensorflow / jax / pytorch / torchscript / mxnet / tensorrt).
  • Experience in big data frameworks (, Spark / Hadoop / Flink), experience in resource management and task scheduling for large scale distributed systems.
  • Participated in Parameter Server system optimization, or index structure optimization for search systems.- Strong background in one of the following fields : Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (, GPU / RDMA) or ML for Systems.

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life.

To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

We are passionate about this and hope you are too. ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.

If you need assistance or a reasonable accommodation,

30+ days ago
Related jobs
Promoted
Apple Inc.
Seattle, Washington

AIML - Sr Engineering Program Manager, Machine Learning FM. Are you passionate about overseeing evaluating ML models and validating AI applications? We are looking for an engineering program manager to lead the development of the next phase of our ML international locales expansion. Strong understan...

Promoted
VirtualVocations
Seattle, Washington

A company is looking for a Senior Technical Program Manager I, Machine Learning. ...

Promoted
Apple
Seattle, Washington

The Apple Knowledge Platform team is building groundbreaking technology for algorithmic search, machine learning, natural language processing, and artificial intelligence. ...

Promoted
VirtualVocations
Seattle, Washington

A company is looking for a Machine Learning Engineer, Infrastructure. ...

Promoted
Apple
Seattle, Washington

Machine Learning and Platforms (MLPT) team is in Apple's AIML Org. MLPT's On-device machine learning (ML) team builds the inference stack that runs all ML networks on Apple Silicon. Machine Learning and Platforms Technology (MLPT) team is in Apple's AIML Org. MLPT's On-device machine learning (ML) t...

Promoted
Snap Inc.
Seattle, Washington

We are looking for a Machine Learning Engineering Manager to join the Ad Targeting engineering team. Snap Engineering teams build fun and technically sophisticated products that reach hundreds of millions of Snapchatters around the world, every day. ...

Promoted
Google Cloud - Minnesota
Kirkland, Washington

Work cross-functionally to assess and select the most critical problems to be solved with our ML infrastructure, build multi-year roadmaps that intersect our long-term data center and infrastructure strategies, and secure executive support. Experience with hardware and data center infrastructure. Ex...

Promoted
Hive
Seattle, Washington

In order to execute our vision, we're constantly growing our machine learning team. Our ideal candidate has experience managing a team of machine learning engineers working on ML projects of a massive scale, contributes innovative ideas and ingenious modeling improvement strategies to the team, and ...

Promoted
Bytedance
Seattle, Washington

Experience on improving core machine learning infrastructure (TensorFlow, Pytorch, and Jax). Responsible for the design and implementation of a global-scale machine learning training system for feeds, ads and search ranking models. Responsible for the design and the implementation of orchestration l...

Amazon.com Services LLC
Bellevue, Washington

We are looking for a Technical Infrastructure Program Manager with expertise in electrical system design, power systems studies, inspection, testing, and maintenance to join the Base-Building Electrical Engineering team. Knowledge of best practices and emerging technologies, related to infrastructur...