Machine Learning Engineer-Model Serving Infrastructure

ByteDance
Seattle, WA, US
$123.1K-$184.5K a year
Full-time
We are sorry. The job offer you are looking for is no longer available.

Responsibilities

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

The mission of our AML team is to push next-generation machine learning algorithms and platform for the recommendation system, ads ranking and search ranking in our company.

We also drive substantial impact on core businesses of the company. Currently, we are looking for Software Engineer - Machine Learning Serving Infrastructure to join our team to support and advance that mission.

Responsibilities :

  • Responsible for the design and implementation of distributed inference infrastructure for feeds, ads and search ranking models.
  • Responsible for building monitoring / managing tools to oversee the reliability and scalability of online inference servers.
  • Responsible for triaging system inefficiency and bottlenecks and improving system performance.
  • Responsible for building tools to analyze bottlenecks and sources of instability and then design and implement solutions.
  • Responsible for collaboration with product teams and providing general solutions to meet their requirements.

Qualifications

Proficient in C / C++ / CUDA, and have solid programming skills.

  • Familiar with deep learning serving frameworks (TensorFlow Serving / TorchScript).
  • Experience in GPU performance optimization.
  • Ability to work independently and complete projects from beginning to end and in a timely manner.
  • Good communication and teamwork skills to clearly communicate technical concepts with other teammates.

Preferred Qualifications :

  • Experience contributing to an open sourced machine learning framework (TensorFlow / PyTorch Script).
  • Experience in developing and deploying large-scale systems.
  • Strong background in one of the following fields : Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.

g., GPU / RDMA) or ML for Systems.

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life.

To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

We are passionate about this and hope you are too.

ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.

If you need assistance or a reasonable accommodation, please reach out to us at [email protected]

Job Information :

For Pay Transparency?Compensation Description (annually)

The base salary range for this position in the selected city is $123120 - $184500 annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location.

Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses / incentives, and restricted stock units.

Our Company Benefits Are Designed To Convey Company Culture And Values, To Create An Efficient And Inspiring Work Environment, And To Support Our Employees To Give Their Best In Both Work And Life.

We Offer The Following Benefits To Eligible Employees :

We cover 100% premium coverage for employee medical insurance, approximately 75% premium coverage for dependents and offer a Health Savings Account(HSA) with a company match.

As well as Dental, Vision, Short / Long term Disability, Basic Life, Voluntary Life and AD&D insurance plans. In addition to Flexible Spending Account(FSA) Options like Health Care, Limited Purpose and Dependent Care.

Our time off and leave plans are : 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) (prorated upon hire and increased by tenure) and 10 paid sick days per year as well as 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.

We also provide generous benefits like mental and emotional health benefits through our EAP and Lyra. A 401K company match, gym and cellphone service reimbursements.

The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

J-18808-Ljbffr

2 days ago
Related jobs
Promoted
Pinterest
Seattle, Washington

With more than 500 million users around the world and 300 billion ideas saved, Pinterest Machine Learning engineers build personalized experiences to help Pinners create a life they love. Build cutting edge technology using the latest advances in deep learning and machine learning to personalize Pin...

Promoted
VirtualVocations
Seattle, Washington

A company is looking for a Machine Learning Engineer (AI-Platform) to enhance their AI model development processes for cancer therapy personalization. ...

Promoted
Apple
Seattle, Washington

The Knowledge Quality team is looking for extraordinary Machine Learning engineers to join a team of world-experts on Large-Scale Data Management and Machine Learning Systems. The Apple Knowledge Quality Team is building the next-generation of machine learning solutions for Knowledge Q&A at Apple an...

Promoted
ESR Healthcare
Seattle, Washington

You will be responsible for building a scalable Machine Learning platform that will be used to train, evaluate, deploy, serve and monitor ML models and to manage data. Proficiency in operating machine learning solutions at scale, covering the end-to-end ML workflow. Expertise with tools and platform...

Promoted
TikTok
Seattle, Washington

Build highly scalable machine learning systems and state-of-the-art machine learning models to improve ads ranking quality and optimize advertisers' marketing strategies. We are seeking Machine Learning Engineers who can help us to improve our existing delivery system that optimizes for advertisers'...

Mediabistro
Seattle, Washington

Principal Machine Learning Engineer. Design, implement, and scale critical machine learning models to support Snap's monetization strategies. Knowledge, Skills & Abilities:Strong understanding of machine learning and deep learning approaches and algorithms, and their applications to advertising, rec...

Promoted
TikTok
Seattle, Washington

As a machine learning engineer on the Ads Signal team, you will develop novel machine learning solutions, build scalable tech foundations and launch various products to maximize signal values for ads in a privacy-preserving way. The team is working on various products, including web/app signals mode...

ByteDance
Seattle, Washington

Covering various sub-directions of machine learning system, including resource scheduling, model training, model inference, data management, and workflow orchestration. Responsibility: Responsible for the machine learning system development of the company's large-scale models, researching new applic...

Apple
Seattle, Washington

Experience in Search, Machine Learning, NLP, Large Language Models and applying these techniques at scaleStrong software engineering skills in a mainstream programming language, such as Python, Go, C/C++Familiarity with NLP/ML tools and packages like Jax, TensorFlow, pyTorch etcPractical experience ...

Stripe
Seattle, Washington

We are exploring new areas and kicking off projects where you can have an outsized impact on the architecture, implementation, and design choices behind these machine learning models and systems. Set and execute a vision for incorporating new advances in machine learning and deep learning in ways th...