Talent.com
Machine Learning, Platform Engineer
Machine Learning, Platform EngineerTogether AI • San Francisco, CA, United States
Machine Learning, Platform Engineer

Machine Learning, Platform Engineer

Together AI • San Francisco, CA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

This role focuses on enabling custom models and dedicated inference on Together. We are responsible for optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling.

Required Qualifications

  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems and API microservices
  • Experience running serverless inference platforms, doing model bring-up on short notice, being on call, or general cloud provider is a very big plus
  • Good taste and ability to thoughtfully discuss how what you’ve built has failed over time
  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
  • Excellent understanding of low level operating systems concepts including concurrency, networking and storage, performance and scale
  • Expert-level programmer in one or more of Golang, Rust, Python, C++, or Haskell
  • Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform
  • Experience with Kubernetes or other container orchestration systems
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
  • Writing-heavy roles or companies are a plus

Key Responsibilities

  • New hires may work on multi-cluster orchestration, portfolio optimization, predictive autoscaling, control panes, model bring-up, light model optimization, APIs for managing deployments, inference worker SDKs, and CLI tools.
  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs
  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems
  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance
  • About Together AI

    Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models.

    Compensation

    We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is : $160,000 - $250,000 + equity + benefits.

    Equal Opportunity

    Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Machine Learning Engineer • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    Machine Learning Engineer, Recommendation

    Machine Learning Engineer, Recommendation

    NewsBreak • Mountain View, CA, US
    serp_jobs.job_card.full_time
    Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy.With over 40 million monthly active users, our flagship platform delivers highly personalized loca...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Machine Learning Systems Platform Engineer

    Machine Learning Systems Platform Engineer

    Blue Signal • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Confidential Opening : Machine Learning Systems Platform Engineer.San Francisco, CA (Hybrid Preferred).A stealth-mode innovator at the forefront of AI infrastructure is seeking a dynamic Machine Lea...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    General Motors • Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    We are seeking a Principal AI Engineer to lead the design and advancement of our AI platform.You will play a key role in shaping the infrastructure that powers large-scale training and cloud infere...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Tubi Tv • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users.Tubi offers the world's largest collection of Hollywood movies and TV shows, th...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    SAP SE • Palo Alto, CA, United States
    serp_jobs.job_card.full_time +1
    We help the world run better At SAP, we keep it simple : you bring your best to us, and we'll bring out the best in you.We're builders touching over 20 industries and 80% of global commerce, and we ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Platform - Berkeley, USA

    Software Engineer, Platform - Berkeley, USA

    Speechify • Berkeley, CA, US
    serp_jobs.job_card.full_time
    The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, Platform - Hayward, USA

    Software Engineer, Platform - Hayward, USA

    Speechify • Hayward, CA, US
    serp_jobs.job_card.full_time
    The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff Machine Learning Engineer (ML Platform)

    Staff Machine Learning Engineer (ML Platform)

    EarnIn • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    Get AI-powered advice on this job and more exclusive features.As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Machine Learning Engineer, AI Platform

    Staff Machine Learning Engineer, AI Platform

    General Motors • Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    Remote : This role is based remotely but if you live within a 50-mile radius of Mountain View, you are expected to report to that location three times a week, at minimum. We are seeking an experience...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Machine Learning Engineer, NLP and multimodal

    Machine Learning Engineer, NLP and multimodal

    NewsBreak • Mountain View, CA, US
    serp_jobs.job_card.full_time
    Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy.With over 40 million monthly active users, our flagship platform delivers highly personalized loca...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Applied AI / ML Engineer

    Applied AI / ML Engineer

    Catalyst Labs • Menlo Park, CA, US
    serp_jobs.job_card.full_time
    Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science.We stand out as an agency thats deeply embedded in our clients recruitment ope...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Machine Learning Engineer, Platform Architecture

    Machine Learning Engineer, Platform Architecture

    Apple Inc. • Cupertino, CA, United States
    serp_jobs.job_card.full_time
    Machine Learning Engineer, Platform Architecture.Cupertino, California, United States Hardware.At Apple, our Platform Architecture group is responsible for connecting our hardware and software into...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Machine Learning Engineer, GenAI Platform

    Machine Learning Engineer, GenAI Platform

    Tome • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Lightfield is a new kind of CRM.It's a collaborative system for founders to find, understand, and serve customers faster than anything before it. It captures every customer interaction, generates an...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Founding Machine Learning Engineer

    Founding Machine Learning Engineer

    PetsApp • Berkeley, CA, United States
    serp_jobs.job_card.full_time
    About us : We are Lighten, a seed-stage healthcare AI startup that's determined to solve fundamental challenges in driving patient insights from healthcare data. We are well funded by top-tier instit...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Black Ore • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Black Ore is building the leading AI platform for financial services.By combining LLMs, proprietary AI / ML and automation we accelerate core workflows for the industry, allow financial services prof...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer, Relevance

    Senior Machine Learning Engineer, Relevance

    Patreon • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Staff Machine Learning Engineer

    Senior Staff Machine Learning Engineer

    Patreon • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Founding Engineer, Machine Learning

    Founding Engineer, Machine Learning

    Stealth • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Founding Engineer, Machine Learning at Stealth — join to apply for this role.We’re an early-stage stealth startup building a new kind of platform for generative media. Our mission is to enable the f...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted