Talent.com
Senior AI Research Engineer, Model Inference (Remote)
Senior AI Research Engineer, Model Inference (Remote)Tether Operations Limited • San Francisco, CA, United States
serp_jobs.error_messages.no_longer_accepting
Senior AI Research Engineer, Model Inference (Remote)

Senior AI Research Engineer, Model Inference (Remote)

Tether Operations Limited • San Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters.remote
job_description.job_card.job_description

Overview

Join Tether and Shape the Future of Digital Finance. At Tether, we’re building solutions that empower businesses to integrate reserve-backed tokens across blockchains with transparency and trust in every transaction.

Innovate with Tether. Our product suite features the world’s most trusted stablecoin, USDT, and digital asset tokenization services. We also offer Tether Power, Tether Data, Tether Education, and Tether Evolution to drive sustainable growth, data sharing, digital learning, and continued innovation.

Why join us? We have a global, remote team and a track record of growth and leadership in fintech. If you have excellent English communication skills and want to contribute to the most innovative platform, Tether is the place for you.

About the job : We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine-tuning for language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).

Responsibilities

  • Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.
  • Implement and optimize full and LoRA fine-tuning for small and large language models across multiple hardware backends.
  • Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).
  • Design, customize, and optimize Vulkan compute shaders for quantized operators and fine-tuning workflows.
  • Investigate and resolve GPU acceleration issues on Vulkan and integrated / mobile GPUs.
  • Architect and prepare support for advanced quantization techniques to improve efficiency and memory usage.
  • Debug and optimize GPU operators (e.g., int8, fp16, fp4, ternary).
  • Integrate and validate quantization workflows for training and inference.
  • Conduct evaluation and benchmarking (e.g., perplexity testing, fine-tuned adapter performance).
  • Conduct GPU testing across desktop and mobile devices.
  • Collaborate with research and engineering teams to prototype, benchmark, and scale new model optimization methods.
  • Deliver production-grade, efficient language model deployment for mobile and edge use cases.
  • Work closely with cross-functional teams to integrate optimized serving and inference frameworks into production pipelines designed for edge and on-device applications. Define clear success metrics such as improved real-world performance, low error rates, robust scalability, optimal memory usage and ensure continuous monitoring and iterative refinements for sustained improvements.
  • Proficiency in C++ and GPU kernel programming.
  • Proven expertise in GPU acceleration with Vulkan framework.
  • Strong background in quantization and mixed-precision model optimization.
  • Experience and expertise in Vulkan compute shader development and customization.
  • Familiarity with LoRA fine-tuning and parameter-efficient training methods.
  • Ability to debug GPU-specific performance and stability issues on desktop and mobile devices.
  • Hands-on experience with mobile GPU acceleration and model inference.
  • Familiarity with large language model architectures (e.g., Qwen, Gemma, LLaMA, Falcon, etc.).
  • Experience implementing custom backward operators for fine-tuning.
  • Experience creating and curating custom datasets for style transfer and domain-specific fine-tuning.
  • Demonstrated ability to apply empirical research to overcome challenges in model development.

Important information for candidates

  • Recruitment scams have become increasingly common. To protect yourself, apply only through official channels. All open roles are listed on our careers page : https : / / tether.recruitee.com /
  • Verify the recruiter’s identity. All recruiters have verified LinkedIn profiles. If unsure, check their profile or contact us through our official website.
  • Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS. All communications are through official company emails and platforms.
  • Double-check email addresses. All communications from us will come from emails ending with @tether.to or @tether.io.
  • We will never request payment or financial details. If someone asks for personal financial information or payment during the hiring process, it is a scam. Please report it immediately.
  • #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Ai Research Engineer • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    Senior AI Research Engineer, Model Inference (Remote)

    Senior AI Research Engineer, Model Inference (Remote)

    Tether Operations Limited • San Francisco, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re building solutions that empower businesses to integrate reserve-backed tokens across blockchains with transparency and trust in ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Research Engineer

    Research Engineer

    OpenAI • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    By applying to this role, you will be considered for Research Engineer roles across all teams at OpenAI.As a Research Engineer here, you will be responsible for building AI systems that can perform...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    AI Robotics Research Engineer

    AI Robotics Research Engineer

    Nimble Robotics • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Nimble is a robotics and AI company inventing and scaling autonomous logistics with intelligent robots to enable fast, efficient, and sustainable commerce. We’re developing generalized robot intelli...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI Research Engineer, Lead

    AI Research Engineer, Lead

    Menlo Ventures • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    The Technical Lead will drive AI research in one or more of the following areas : structure prediction, protein design, and lead optimization. PhD in AI, Machine Learning, Bioinformatics, or related ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Engineer - Relational Foundation Models & Agentic Systems

    AI Engineer - Relational Foundation Models & Agentic Systems

    Kumo • Mountain View, CA, US
    serp_jobs.job_card.full_time
    Kumo Relational Foundation Model (RFM).This is your chance to be part of that momentum.This is an engineering role at its core, but you’ll also. We’re looking for someone who thrives in ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Lead AI Engineer (FM Hosting, LLM Inference)

    Lead AI Engineer (FM Hosting, LLM Inference)

    Capital One • San Francisco, CA, United States
    serp_jobs.job_card.part_time
    You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing.You want to work on problems that will help change banking for good.Passion for s...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior AI Research Engineer, Model Inference (100% Remote)

    Senior AI Research Engineer, Model Inference (100% Remote)

    Tether Operations Limited • San Francisco, CA, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days
    Research Engineer - Casual AI

    Research Engineer - Casual AI

    Mxv • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    IC4; Estimated salary commensurate with experience.IC5; Estimated salary commensurate with experience.Our formula ensures new hires earn at or above real-time benchmarks. Our generous equity program...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior AI Engineer

    Senior AI Engineer

    Foundation Robotics Labs Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Our mission is to create advanced robots that can operate in complex environments, reducing human risk in conflict zones and enhancing efficiency in labor-intensive industries.We are on the lookout...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior AI Engineer

    Senior AI Engineer

    Rhythms • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Rhythms is building an AI operating system transforming ordinary teams into extraordinary ones through cutting-edge artificial intelligence. Founded by entrepreneurs behind Chronus (acquired by Priv...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Research Engineer - Multimodal & Video Foundation Model (Remote)

    Senior Research Engineer - Multimodal & Video Foundation Model (Remote)

    Tether Operations Limited • San Francisco, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re pioneering a global financial revolution with solutions that empower businesses—from exchanges and wallets to payment processors...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI Inference Engineer

    AI Inference Engineer

    Perplexity AI • San Francisco, CA, US
    serp_jobs.job_card.full_time
    Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world's leading AI platforms. Perplexity has raised over $1B in venture investment from some ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Research Engineer

    Senior Research Engineer

    Mem0 Official Documentation • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Own the end-to-end lifecycle of memory features—from research to production.You’ll fine-tune models for extraction, updates, consolidation / forgetting, and conflict resolution; turn customer pain po...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    ML / AI Research Engineer — Agentic AI Lab (Founding Team)

    ML / AI Research Engineer — Agentic AI Lab (Founding Team)

    Fabrion • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Competitive salary + meaningful equity (founding tier).Backed by 8VC, we\'re building a world-class team to tackle one of the industry’s most critical infrastructure problems.We’re designing the fu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Generative AI Engineer (Remote)

    Generative AI Engineer (Remote)

    Jobs via Dice • San Francisco, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Generative AI Engineer (Remote).Generative AI Engineer (Remote).Y Combinator-backed Bio-Tech company is looking for a Senior Data Engineer to join their growing team!. This Jobot Job is hosted by : S...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Research Engineer

    AI Research Engineer

    Menlo Ventures • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Chai Discovery is unlocking new biology with artificial intelligence by building state-of-the-art foundation models.The company is operating in stealth mode by a highly experienced founding team, w...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Applied AI Inference Engineer

    Applied AI Inference Engineer

    Baseten • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast.Backed by top investors including IVP, Spark Capital, Greylock, and Conviction, we’re ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Research Engineer, Handshake AI

    AI Research Engineer, Handshake AI

    Handshake • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Handshake is building the future of human data for AI.We partner directly with top AI labs to power large language model (LLM) training and evaluation with high-quality, expert-generated data.As AI...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Memory Innovation Engineer, AI and Machine Learning

    Memory Innovation Engineer, AI and Machine Learning

    Micron Technology, Inc. • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Our vision is to transform how the world uses information to enrich life for all.Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    Chime • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Chime’s AI / ML team is building models, services, and platforms that transform how millions of users manage and grow their financial lives. We are looking for a Senior AI / ML Engineer with deep techni...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted