Talent.com
Reinforcement Learning Engineer

Reinforcement Learning Engineer

Code MetalBoston, MA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Description

Job Description

At Code Metal AI, you’ll be part of a world-class team with talent from MIT, OpenAI and other top companies, focused on pioneering work in large language models (LLMs) and code generation. Our projects directly involve leading chip manufacturers, applying advanced AI to solve meaningful, practical challenges with real-world impact.

This role bridges two critical areas :

Production

  • Build and maintain robust distributed training systems using PyTorch (2+ years experience required).
  • Design and implement scalable data curation and quality assurance pipelines to ensure top-tier training datasets.
  • Develop orchestration tools that manage complex workflows across large-scale AI model training and evaluation.

Research

  • Drive innovation by developing evaluation frameworks and reinforcement learning solutions, including recent advancements in Reinforcement Learning with Human Feedback (RLHF).
  • Engage with frontier research through open-source projects and potential publications, applying RLHF to Large Language Models (LLMs), ideally focusing on code generation tasks.
  • Requirements

  • 2+ years experience in distributed training, preferably with PyTorch.
  • Strong background in reinforcement learning, with recent RLHF experience highly preferred.
  • Proven ability to build data curation and quality assurance pipelines.
  • Experience with evaluation framework development.
  • Ideally, experience across both data pipeline and orchestration sides.
  • Eligible for TS / SCI clearance.
  • Nice to have :

  • Contributions to open-source AI or ML projects.
  • Published work or demonstrable research experience in related fields.
  • Hands-on experience applying RLHF to LLMs, especially for code generation.
  • Experience with large-scale synthetic data generation.
  • Benefits

  • Health care plan with 100% premium coverage, including medical, dental, and vision.
  • 401k with 5% matching.
  • Paid Time Off (Uncapped Vacation, plus Sick & Public Holidays).
  • Flexible hybrid work arrangement.
  • Relocation assistance for qualifying employees.
  • serp_jobs.job_alerts.create_a_job

    Learning Engineer • Boston, MA, US

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    System Modeler, Senior - TS

    System Modeler, Senior - TS

    DCS CorporationBedford, MA, United States
    serp_jobs.job_card.full_time
    DCS has an exciting opportunity for a.Command, Control, Communications, and Battle Management Division (C3BM).Command, Control, Communications, and Battle Management (C3BM) has been tasked with del...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Machine Learning Research Engineer

    Machine Learning Research Engineer

    Manifold BioBoston, MA, United States
    serp_jobs.job_card.full_time
    Machine Learning Research Engineer.Machine Learning Research Engineer.Get AI-powered advice on this job and more exclusive features. Manifold Bio is a dynamic biotech company building a pipeline of ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    HarnhamBoston, MA, US
    serp_jobs.job_card.full_time
    Senior Machine Learning Researcher – Biological Foundation Models.Boston, MA (onsite, full-time).A well-funded, venture-backed biotech / AI company is building foundation-scale models for biolo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Machine Learning Engineer, Distributed vLLM Inference

    Principal Machine Learning Engineer, Distributed vLLM Inference

    Red HatBoston, MA, United States
    serp_jobs.job_card.full_time +1
    Principal Machine Learning Engineer, Distributed vLLM Inference page is loaded## Principal Machine Learning Engineer, Distributed vLLM Inferenceremote type : Hybridlocations : Bostonposted on : ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal / Senior Principal Machine Learning Engineer, llama.cpp

    Principal / Senior Principal Machine Learning Engineer, llama.cpp

    Red HatBoston, MA, United States
    serp_jobs.job_card.full_time +1
    Principal / Senior Principal Machine Learning Engineer, llama.Principal / Senior Principal Machine Learning Engineer, llama. Hybridlocations : Bostonposted on : Posted Todayjob requisition id : R-046...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Machine Learning Engineer – Biological Foundation Models

    Machine Learning Engineer – Biological Foundation Models

    Metric BioBoston, MA, United States
    serp_jobs.job_card.full_time
    Machine Learning Engineer – Biological Foundation Models.Metric Bio has partnered with a venture-backed biotech at the intersection of AI and cell biology. This team is building foundation models on...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Robotics Engineer, Machine Learning

    Principal Robotics Engineer, Machine Learning

    Berkshire GreyBedford, MA, US
    serp_jobs.job_card.full_time
    Salary : Base salary range $162k-200k.Berkshire Grey is a leader in the field of AI and robotics, providing innovative solutions for e-commerce, retail replenishment, and logistics.Our technology au...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Lead Semiconductor Reliability Engineer

    Lead Semiconductor Reliability Engineer

    RaytheonAndover, MA, United States
    serp_jobs.job_card.full_time
    MA112 : Andover MA 358 Lowell St Dukes 358 Lowell Street Dukes, Andover, MA, 01810 USA.Person, or Immigration Status Requirements : . The ability to obtain and maintain a U.At Raytheon, the foundation ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Machine Learning Engineer Boston, US > >

    Machine Learning Engineer Boston, US > >

    Airspace IntelligenceBoston, MA, United States
    serp_jobs.job_card.full_time
    ASI enables success for the world's most complex operations.From critical infrastructure to defense, we serve major airlines and U. Backed by top-tier investors—including Andreessen Horowitz, Spark ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Draper LabsCambridge, MA, United States
    serp_jobs.job_card.full_time
    Draper is an independent, nonprofit research and development company headquartered in Cambridge, MA.The 2,000+ employees of Draper tackle important national challenges with a promise of delivering ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Lead, Systems Engineer (Cost Engineer - TruePlanning))

    Lead, Systems Engineer (Cost Engineer - TruePlanning))

    L3Harris TechnologiesNORTH WALTHAM, Massachusetts, United States
    serp_jobs.job_card.full_time
    L3Harris is dedicated to recruiting and developing high-performing talent who are passionate about what they do.Our employees are unified in a shared dedication to our customers’ mission and quest ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Lead Machine Learning Engineer - ML / AI

    Lead Machine Learning Engineer - ML / AI

    Capital OneCAMBRIDGE, Massachusetts, United States
    serp_jobs.job_card.full_time +1
    Lead Machine Learning Engineer - ML / AI.At Capital One, we are changing banking for good by creating responsible and reliable AI-powered systems. Our investments in technology infrastructure and worl...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Systems Engineer, Principal - TS

    Systems Engineer, Principal - TS

    DCS CorporationBedford, MA, United States
    serp_jobs.job_card.full_time
    DCS has an exciting opportunity for a.Command, Control, Communications, and Battle Management Division (C3BM).Command, Control, Communications, and Battle Management (C3BM) has been tasked with del...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Gecko Robotics, Inc.Boston, MA, United States
    serp_jobs.job_card.full_time
    We’re changing the way engineering and robotics connect to the built world.We need creative problem solvers to help accelerate our mission. We believe it’s time for a new kind of engineering role ca...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    LEARNING DESIGNER, BU Virtual Learning & Innovation

    LEARNING DESIGNER, BU Virtual Learning & Innovation

    MediabistroBoston, MA, United States
    serp_jobs.job_card.full_time
    Boston University Virtual is a unit at Boston University focused on the creation of high-quality online degree and certificate programs. We are seeking an experienced Learning Designer to work colla...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Lead Machine Learning Engineer (Intelligent Foundations & Experiences)

    Lead Machine Learning Engineer (Intelligent Foundations & Experiences)

    Capital OneCAMBRIDGE, Massachusetts, United States
    serp_jobs.job_card.full_time +1
    Lead Machine Learning Engineer (Intelligent Foundations & Experiences).As a Capital One Machine Learning Engineer (MLE), you'll be part of an Agile team dedicated to productionizing machine learnin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Machine Learning Operations Engineer

    Machine Learning Operations Engineer

    Cyvl, Inc.Boston, MA, United States
    serp_jobs.job_card.full_time
    Cyvl is a Boston-based tech startup revolutionizing the way civil engineering firms and governments map and manage transportation infrastructure. Our enterprise-grade hardware and software solutions...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Machine Learning Engineer

    Machine Learning Engineer

    PeKe LabsBoston, MA, United States
    serp_jobs.job_card.full_time
    We're hiring our first ML Software Engineer to work alongside our cofounder, Thane Hunt, in developing our software stack to help interpret raw data s. We're building the next generation of technolo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Future Readiness Learning Lead

    Future Readiness Learning Lead

    ModernaCambridge, MA, US
    serp_jobs.job_card.full_time
    Future-Readiness Learning Lead.Moderna has set an ambitious goal to deliver 10 products in three years.Achieving this requires a workforce that is constantly learning, adapting, and growing.Our Lea...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Reddit, Inc. is hiring : Senior Machine Learning Manager - Ads Content Understand

    Reddit, Inc. is hiring : Senior Machine Learning Manager - Ads Content Understand

    MediabistroBoston, MA, United States
    serp_jobs.job_card.full_time
    Reddit is a community of communities.It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days