Talent.com
Senior AI Research Engineer, Model Inference (Remote)
Senior AI Research Engineer, Model Inference (Remote)Tether Operations Limited • New York, NY, US
Senior AI Research Engineer, Model Inference (Remote)

Senior AI Research Engineer, Model Inference (Remote)

Tether Operations Limited • New York, NY, US
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters.remote
job_description.job_card.job_description

Overview

Join Tether and Shape the Future of Digital Finance. At Tether, we're building solutions that empower businesses to integrate reserve-backed tokens across blockchains with transparency and trust in every transaction.

Innovate with Tether. Our product suite features the world's most trusted stablecoin, USDT, and digital asset tokenization services. We also offer Tether Power, Tether Data, Tether Education, and Tether Evolution to drive sustainable growth, data sharing, digital learning, and continued innovation.

Why join us? We have a global, remote team and a track record of growth and leadership in fintech. If you have excellent English communication skills and want to contribute to the most innovative platform, Tether is the place for you.

About the job : We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine-tuning for language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).

Responsibilities

  • Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.
  • Implement and optimize full and LoRA fine-tuning for small and large language models across multiple hardware backends.
  • Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).
  • Design, customize, and optimize Vulkan compute shaders for quantized operators and fine-tuning workflows.
  • Investigate and resolve GPU acceleration issues on Vulkan and integrated / mobile GPUs.
  • Architect and prepare support for advanced quantization techniques to improve efficiency and memory usage.
  • Debug and optimize GPU operators (e.g., int8, fp16, fp4, ternary).
  • Integrate and validate quantization workflows for training and inference.
  • Conduct evaluation and benchmarking (e.g., perplexity testing, fine-tuned adapter performance).
  • Conduct GPU testing across desktop and mobile devices.
  • Collaborate with research and engineering teams to prototype, benchmark, and scale new model optimization methods.
  • Deliver production-grade, efficient language model deployment for mobile and edge use cases.
  • Work closely with cross-functional teams to integrate optimized serving and inference frameworks into production pipelines designed for edge and on-device applications. Define clear success metrics such as improved real-world performance, low error rates, robust scalability, optimal memory usage and ensure continuous monitoring and iterative refinements for sustained improvements.
  • Proficiency in C++ and GPU kernel programming.
  • Proven expertise in GPU acceleration with Vulkan framework.
  • Strong background in quantization and mixed-precision model optimization.
  • Experience and expertise in Vulkan compute shader development and customization.
  • Familiarity with LoRA fine-tuning and parameter-efficient training methods.
  • Ability to debug GPU-specific performance and stability issues on desktop and mobile devices.
  • Hands-on experience with mobile GPU acceleration and model inference.
  • Familiarity with large language model architectures (e.g., Qwen, Gemma, LLaMA, Falcon, etc.).
  • Experience implementing custom backward operators for fine-tuning.
  • Experience creating and curating custom datasets for style transfer and domain-specific fine-tuning.
  • Demonstrated ability to apply empirical research to overcome challenges in model development.

Important information for candidates

  • Recruitment scams have become increasingly common. To protect yourself, apply only through official channels. All open roles are listed on our careers page : https : / / tether.recruitee.com /
  • Verify the recruiter's identity. All recruiters have verified LinkedIn profiles. If unsure, check their profile or contact us through our official website.
  • Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS. All communications are through official company emails and platforms.
  • Double-check email addresses. All communications from us will come from emails ending with @tether.to or @tether.io.
  • We will never request payment or financial details. If someone asks for personal financial information or payment during the hiring process, it is a scam. Please report it immediately.
  • J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Ai Research Engineer • New York, NY, US

    Job_description.internal_linking.related_jobs
    Senior AI Research Engineer, Model Inference (100% Remote)

    Senior AI Research Engineer, Model Inference (100% Remote)

    Tether Operations Limited • New York, NY, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days
    Sr Director Analyst, AI Innovation & Emerging AI Trends (Remote United States)

    Sr Director Analyst, AI Innovation & Emerging AI Trends (Remote United States)

    Gartner • Stamford, CT, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Gartner Analysts are industry thought leaders who create must-have research, market predictions and best practices for a broad range of world-leading organizations. A Senior Director serves as a lea...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior NLP Research Engineer - Artificial Intelligence

    Senior NLP Research Engineer - Artificial Intelligence

    Neural Information Processing Systems • New York, NY, US
    serp_jobs.job_card.full_time
    Description & Requirements.Bloomberg's Engineering AI department has 350+ AI practitioners building highly sought after products and features that often require novel innovations.We are investi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Research Engineer - Wearable Agents AI

    Research Engineer - Wearable Agents AI

    Meta • New York, NY, US
    serp_jobs.job_card.full_time
    We are seeking a Research Engineer to join the Wearable Agents AI team.This role will focus on creating advanced machine learning models and systems that power the Wearable AI Assistant's memory an...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior Research Engineer

    Senior Research Engineer

    AssemblyAI • New York, NY, US
    serp_jobs.job_card.full_time
    At AssemblyAI, we're building at the forefront of Speech AI, creating powerful models for speech-to-text and speech understanding available through a straightforward API. With more than 200,000 deve...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    AI Research Engineer - Datadog AI Research (DAIR)

    AI Research Engineer - Datadog AI Research (DAIR)

    Datadog • New York, NY, US
    serp_jobs.job_card.full_time
    AI Research Engineer – Datadog AI Research (DAIR).As a research engineer on our team, you will partner with research scientists to turn research ideas into working systems; building the data, tooli...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Research Engineer, Wearables AI

    Research Engineer, Wearables AI

    Meta • New York, NY, US
    serp_jobs.job_card.full_time
    Reality Labs at Meta is seeking Research Engineers to develop and enhance on-device AI / ML models for our Wearables devices. This role is pivotal in advancing our capabilities in interpreting and pro...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Founding Audio AI Research Engineer

    Founding Audio AI Research Engineer

    David AI • New York, NY, US
    serp_jobs.job_card.full_time
    David AI is the first audio data research company.We bring an R&D approach to data–developing datasets with the same rigor AI labs bring to models. Speech is versatile, accessible, and.To unlock...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Machine Learning Research Lead, Security & Policy Research Lab

    Machine Learning Research Lead, Security & Policy Research Lab

    Scale AI, Inc. • New York, NY, United States
    serp_jobs.job_card.full_time
    As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding AI models and systems.Building on this expertis...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Research Engineer, GenAI

    Research Engineer, GenAI

    Kiddom • New York, NY, US
    serp_jobs.job_card.full_time +1
    Get AI-powered advice on this job and more exclusive features.This range is provided by Kiddom.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.K...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Research Engineer, Language - Wearables Agents AI

    Research Engineer, Language - Wearables Agents AI

    The Rundown AI, Inc. • New York, NY, US
    serp_jobs.job_card.full_time
    Reality Labs at Meta is building products that make it easier for people to connect with the ones they love most, enjoy top-notch, wire-free VR, and push the future of computing platforms.We are a ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Research Engineer (Robotics) - World Modelling, FAIR

    Research Engineer (Robotics) - World Modelling, FAIR

    The Rundown AI, Inc. • New York, NY, US
    serp_jobs.job_card.full_time
    Meta is seeking a Research Engineer to join the Fundamental AI Research (FAIR) team, one of the top industrial AI research organizations in the world. Come join FAIR's efforts to build world models,...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior AI Engineer, Applications Integrations

    Senior AI Engineer, Applications Integrations

    White & Case LLP • New York, NY, United States
    serp_jobs.job_card.full_time
    Senior AI Engineer, Applications Integrations.White & Case is an elite global law firm serving leading companies, financial institutions and governments worldwide. Our long history as an internation...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior GenAI Platform Engineer - Artificial Intelligence

    Senior GenAI Platform Engineer - Artificial Intelligence

    Bloomberg • New York, NY, US
    serp_jobs.job_card.full_time
    Senior GenAI Platform Engineer - Artificial Intelligence.Description & Requirements.Bloomberg's Engineering AI department has 350+ AI practitioners building highly sought after products and fea...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior / Staff AI Research Engineer

    Senior / Staff AI Research Engineer

    Hume AI • New York, NY, US
    serp_jobs.job_card.full_time
    Hume AI is seeking talented researchers and engineers interested in working with our AI research team to build state-of-the-art speech-language models (SLMs). Our new SLM training method—reinforceme...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Senior MLOps Engineer - Artificial Intelligence

    Senior MLOps Engineer - Artificial Intelligence

    Bloomberg L.P. • New York, NY, US
    serp_jobs.job_card.full_time
    Senior MLOps Engineer - Artificial Intelligence.Description & Requirements.Bloomberg's Engineering AI department has 350+ AI practitioners building highly sought after products and features tha...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI Research Engineer, Handshake AI

    AI Research Engineer, Handshake AI

    Handshake • New York, NY, US
    serp_jobs.job_card.full_time
    AI Research Engineer, Handshake AI.AI Research Engineer, Handshake AI.Handshake is building the future of human data for AI. We partner directly with top AI labs to power large language model (LLM) ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Research Engineer, Language - Wearables Voice AI

    Research Engineer, Language - Wearables Voice AI

    Meta • New York, NY, US
    serp_jobs.job_card.full_time
    As a Research Engineer in Wearables Voice AI, you will have the opportunity to perform ground-breaking research to advance the device-driven AI Assistant effort at Meta – e.RayBan Smart Glasses and...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new