Senior Data Engineer - Community Discovery ML

Twitch
California, Missouri, US
Full-time

About Us

A variety of soft skills and experience may be required for the following role Please ensure you check the overview below carefully.

Twitch is the world’s biggest live streaming service, with global communities built around gaming, entertainment, music, sports, cooking, and more.

It is where thousands of communities come together for whatever, every day.

We’re about community, inside and out. You’ll find coworkers who are eager to team up, collaborate, and smash (or elegantly solve) problems together.

We’re on a quest to empower live communities.

About the Role

The Community Discovery ML team focuses on providing personalized, relevant experiences for Twitch users through Recommendation and Search.

We are looking for a senior data engineer to join us. You will be the first data engineer hired in a hybrid team of ML engineers and scientists, working on data challenges related to ML models and products.

You will extend, design, and build new capabilities in our data systems to ensure fast ML model development and productionization.

You will impact cross teams by defining expectations for data usage patterns and data quality.

You will report to an Engineering Manager and work in San Francisco / Bay Area.

You Will :

  • Oversee team data architecture to meet ML use cases in production.
  • Design and build scalable data pipelines to support personalization models.
  • Develop and maintain low-latency, large-scale streaming and batch data processing systems.
  • Collaborate with applied scientists and ML engineers to integrate data into production models.
  • Optimize data workflows for performance and cost efficiency.
  • Implement best practices for data governance and security.
  • Troubleshoot and resolve data-related issues, focusing on identifying and solving data quality problems.
  • Mentor others in the team in data-related solutions and skills.

You Have :

  • 6+ years of experience as a data engineer or in a similar role.
  • Proficiency in SQL, Python, or Scala.
  • Experience with building batch and streaming data pipelines with high throughput and low latency.
  • Strong understanding of data architecture and data modeling principles.
  • Experience analyzing large datasets to identify gaps and inconsistencies, provide data insights, and promote effective product solutions.
  • Hands-on experience with cloud platforms (AWS, GCP, or Azure) and their data services.
  • Familiarity with ETL tools and data warehousing solutions.
  • Experience with distributed data processing technologies such as Apache Spark, Flink, and Kafka.
  • Experience working with cross-functional roles like ML engineers and scientists.

Bonus Points

  • Experience with AWS data ecosystems like Redshift, Kinesis, and Glue.
  • Understand data requirements for ML production systems.
  • Extensive experience with mature and large-scale production data systems and capable of defining a strong North Star and making increment progress towards that.

Perks

  • Medical, Dental, Vision & Disability Insurance
  • 401(k)
  • Maternity & Parental Leave
  • Flexible PTO
  • Amazon Employee Discount

We are an equal opportunity employer and value diversity at Twitch. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Twitch values your privacy. Please consult our Candidate Privacy Notice for information about how we collect, use, and disclose personal information of our candidates.

Job ID : TW8541

J-18808-Ljbffr

11 hours ago
Related jobs
Promoted
Twitch
California, Missouri

You will be the first data engineer hired in a hybrid team of ML engineers and scientists, working on data challenges related to ML models and products. The Community Discovery ML team focuses on providing personalized, relevant experiences for Twitch users through Recommendation and Search. We are ...

Promoted
Celestica Inc.
California, Missouri

Senior Principal Engineer, AI/ML System Architect. This position is for a Senior Principal Engineer, AI/ML System Architect. As system architect, one will define the architecture of leading and competitive AI systems, lead new technology research, study market trends, customer interface, and lead Ce...

Promoted
Recruiting from Scratch
California, Missouri

We are seeking a highly skilled and motivated AI Software Engineer to join our team. Explore state-of-the-art ML models for a variety of tasks, including TTS, CV/scene understanding, and 3D reconstruction. Solid background in algorithms, data structures, and object-oriented programming. General expe...

Promoted
Twitch
California, Missouri

Lead the model development process to implement production algorithms, including exploratory data analysis, data modeling, feature engineering, model training, testing, deployment, and monitoring. We’re about community, inside and out. We are looking for an experienced Data Scientist with a strong m...

Promoted
Amazon
California, Missouri

The Colo Regional Engineer is the engineering representative on behalf of the Data Center Field Engineering Team responsible for the successful delivery of projects. Senior Mechanical Data Center Colocation Regional Engineer. As a Colo Regional Engineer, you will drive Mechanical/Electrical Engineer...

Promoted
Capgemini
California, Missouri

The Senior Machine Learning Engineer supports Machine Learning projects from strategy through implementation and on-going improvements. Extracts and analyzes data from various structured and unstructured sources, including databases,. Requires experience with relational databases, document databases...

Highmark Health
MO, Working at Home, Montana

In partnership with other business, platform, technology, and analytic teams across the enterprise, design, build and maintain well-engineered data solutions in a variety of environments, including traditional data warehouses, Big Data solutions, and cloud-oriented platforms. Align with security, da...

Promoted
Level Infinite
California, Missouri

Experience with building complex data products and machine learning systems on large datasets preferred. Work with large-scale datasets to solve complex business and technical problems in gaming through analytics, forecasting, machine learning, recommender system, generative AI, experimentation, cau...

Promoted
The Morning Star Company
California, Missouri

Design and oversee ETL processes to integrate data from various sources into the BI environment, ensuring data quality and consistency. Establish and enforce data governance practices and ensure compliance with data security policies and regulations. SSIS, Data Factory) and data modeling. This role ...

Promoted
Blue Shield of California
California, Missouri

Liaise with cross functional teams of data architects, modelers and data quality analysts, who are responsible for building the Digital Health Record within the enterprise data warehouse. The Data Analyst-Health, Principal, will report to Sr. Establish and monitor the data quality and governance pro...