Talent.com
Machine Learning Systems Platform Engineer

Machine Learning Systems Platform Engineer

Blue SignalSan Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Confidential Opening : Machine Learning Systems Platform Engineer

Location : San Francisco, CA (Hybrid Preferred)

Overview

A stealth-mode innovator at the forefront of AI infrastructure is seeking a dynamic Machine Learning Systems Platform Engineer to build the backbone of their next-generation ML ecosystem. This team is leading the charge in developing tools and platforms that empower world-class ML teams to experiment, scale, and deploy faster than ever before.

In this key engineering role, you will architect and optimize the systems that make high-performance AI development possible. From training and tuning to inference and monitoring, your work will enable cutting-edge ML initiatives across the organization. You will work closely with ML scientists and engineers to ensure seamless integration of models into production environments.

Key Responsibilities

  • Build and maintain robust infrastructure to support machine learning workloads at scale, including training pipelines, tuning environments, and deployment frameworks.
  • Develop and automate MLOps pipelines for reproducibility, experiment tracking, model versioning, and validation.
  • Optimize cloud and on-prem GPU compute utilization across orchestration platforms.
  • Lead the implementation of tools for model rollback, observability, and system health monitoring.
  • Collaborate with cross-functional teams to ensure reliability, scalability, and maintainability of ML systems.

Qualifications

  • 3+ years of experience in designing and deploying ML infrastructure or production-grade MLOps tools.
  • Fluency in backend development and infrastructure engineering, especially with Python, Go, Bash, Terraform, or Helm.
  • Experience with ML orchestration tools such as Kubeflow, Airflow, MLflow, Ray, or Metaflow.
  • Proficient in containerization and cloud-native technologies, including Docker, Kubernetes, Argo, or managed ML platforms like SageMaker.
  • Deep understanding of cloud environments (AWS, GCP, or Azure) and GPU-accelerated workloads.
  • Preferred Skills

  • Exposure to distributed training techniques (FSDP, DeepSpeed, Horovod).
  • Knowledge of CI / CD strategies for ML and data drift detection methods.
  • Awareness of privacy, compliance, and security practices in ML systems.
  • Prior experience in infrastructure-first or developer-oriented AI organizations.
  • Compensation and Benefits

  • Base salary range : $160,000 to $230,000 DOE
  • Significant equity package and comprehensive benefits
  • Opportunity to work at the core of transformative AI innovation
  • Why Apply?

    This is a rare opportunity to own and shape the ML platform behind AI that will define the next era. If you thrive in system-level problem solving and want to leave your mark on how machine learning is built at scale, this role is for you.

    Apply today to learn more about this confidential opportunity and how you can play a part in the future of AI engineering.

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Machine Learning Engineer • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Systems Engineer - Platform Administrator

    Systems Engineer - Platform Administrator

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Systems Engineer - Platform Administrator.Key Responsibilities Install, configure, and maintain enterprise servers, networks, and virtualization platforms Manage and s...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Machine Learning Engineer, GenAI Applied ML

    Machine Learning Engineer, GenAI Applied ML

    Scale AI, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Solutions Engineer

    Principal Solutions Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Solution Engineer to join their solutions team as a trusted technical advisor for customers throughout the sales cycle. Key Responsibilities Lead technical dis...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Staff Machine Learning Engineer to design, build, and deploy advanced AI systems for financial technology applications. Key Responsibilities Develop and fine-tune large ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Lead Machine Learning Engineer

    Senior Lead Machine Learning Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Lead Machine Learning Engineer to lead the design and delivery of AI-powered intelligence systems. Responsibilities Design and implement infrastructure for agenti...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Applied Machine Learning Engineer

    Applied Machine Learning Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an Applied Machine Learning Engineer, Circuit Design - New College Grad 2025.Key Responsibilities Collaborate with a multi-functional team on Pre-silicon and Post Silicon...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Software AI / ML Developer

    Principal Software AI / ML Developer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Remote Principal Software AI / ML Developer.Key Responsibilities Architect and implement scalable applications using large language models (LLMs) and develop effective re...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior ML Ops Engineer

    Senior ML Ops Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior ML Ops Engineer to join their AI infrastructure team.Key Responsibilities Architect, implement, and maintain end-to-end ML pipelines for data ingestion, training...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Manager of Applied Machine Learning

    Manager of Applied Machine Learning

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Manager, Applied Machine Learning.Key Responsibilities Lead a team of machine learning scientists and engineers to design, develop, and deliver scalable ML solutions O...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Machine Learning Systems Engineer, RL Engineering

    Machine Learning Systems Engineer, RL Engineering

    Menlo VenturesSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Machine Learning Engineer

    Machine Learning Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Machine Learning Engineer for a 100% remote position.Key Responsibilities Design, build, and maintain machine learning models for production deployment Develop scalabl...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Machine Learning Engineer, Planning

    Machine Learning Engineer, Planning

    WaymoMountain View, CA, United States
    serp_jobs.job_card.full_time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Senior Software Engineer, AI Systems

    Senior Software Engineer, AI Systems

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Software Engineer, AI Systems - vLLM and MLPerf.Key Responsibilities Design and implement efficient inference systems for generative AI models Define benchmarki...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Systems Engineer

    Systems Engineer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Systems Scripting Engineer to develop automation solutions for IT operations software.Key Responsibilities Build content for IT operations software products through scr...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    ML Ops Engineer

    ML Ops Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an ML Ops Engineer to join their AI infrastructure team.Key Responsibilities Architect, implement, and maintain end-to-end ML pipelines Automate model training and deplo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Machine Learning Engineer, Platform Architecture

    Machine Learning Engineer, Platform Architecture

    Apple Inc.Cupertino, CA, United States
    serp_jobs.job_card.full_time
    Machine Learning Engineer, Platform Architecture.Cupertino, California, United States Hardware.At Apple, our Platform Architecture group is responsible for connecting our hardware and software into...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Machine Learning Engineer to join their Data Science team.Key Responsibilities : Design, build, and deploy end-to-end machine learning systems and data pipelines ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30