Senior AI Research Engineer, Model Inference (Remote)Tether Operations Limited • San Francisco, CA, United States

serp_jobs.error_messages.no_longer_accepting

Senior AI Research Engineer, Model Inference (Remote)

Tether Operations Limited • San Francisco, CA, United States

job_description.job_card.variable_days_ago

serp_jobs.job_preview.job_type

serp_jobs.job_card.full_time

serp_jobs.filters.remote

job_description.job_card.job_description

Overview

Join Tether and Shape the Future of Digital Finance. At Tether, we’re building solutions that empower businesses to integrate reserve-backed tokens across blockchains with transparency and trust in every transaction.

Innovate with Tether. Our product suite features the world’s most trusted stablecoin, USDT, and digital asset tokenization services. We also offer Tether Power, Tether Data, Tether Education, and Tether Evolution to drive sustainable growth, data sharing, digital learning, and continued innovation.

Why join us? We have a global, remote team and a track record of growth and leadership in fintech. If you have excellent English communication skills and want to contribute to the most innovative platform, Tether is the place for you.

About the job : We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine-tuning for language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).

Responsibilities

Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.
Implement and optimize full and LoRA fine-tuning for small and large language models across multiple hardware backends.
Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).
Design, customize, and optimize Vulkan compute shaders for quantized operators and fine-tuning workflows.
Investigate and resolve GPU acceleration issues on Vulkan and integrated / mobile GPUs.
Architect and prepare support for advanced quantization techniques to improve efficiency and memory usage.
Debug and optimize GPU operators (e.g., int8, fp16, fp4, ternary).
Integrate and validate quantization workflows for training and inference.
Conduct evaluation and benchmarking (e.g., perplexity testing, fine-tuned adapter performance).
Conduct GPU testing across desktop and mobile devices.
Collaborate with research and engineering teams to prototype, benchmark, and scale new model optimization methods.
Deliver production-grade, efficient language model deployment for mobile and edge use cases.
Work closely with cross-functional teams to integrate optimized serving and inference frameworks into production pipelines designed for edge and on-device applications. Define clear success metrics such as improved real-world performance, low error rates, robust scalability, optimal memory usage and ensure continuous monitoring and iterative refinements for sustained improvements.
Proficiency in C++ and GPU kernel programming.
Proven expertise in GPU acceleration with Vulkan framework.
Strong background in quantization and mixed-precision model optimization.
Experience and expertise in Vulkan compute shader development and customization.
Familiarity with LoRA fine-tuning and parameter-efficient training methods.
Ability to debug GPU-specific performance and stability issues on desktop and mobile devices.
Hands-on experience with mobile GPU acceleration and model inference.
Familiarity with large language model architectures (e.g., Qwen, Gemma, LLaMA, Falcon, etc.).
Experience implementing custom backward operators for fine-tuning.
Experience creating and curating custom datasets for style transfer and domain-specific fine-tuning.
Demonstrated ability to apply empirical research to overcome challenges in model development.

Important information for candidates

Recruitment scams have become increasingly common. To protect yourself, apply only through official channels. All open roles are listed on our careers page : https : / / tether.recruitee.com /

Verify the recruiter’s identity. All recruiters have verified LinkedIn profiles. If unsure, check their profile or contact us through our official website.

Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS. All communications are through official company emails and platforms.

Double-check email addresses. All communications from us will come from emails ending with @tether.to or @tether.io.

We will never request payment or financial details. If someone asks for personal financial information or payment during the hiring process, it is a scam. Please report it immediately.

#J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Ai Research Engineer • San Francisco, CA, United States

Job_description.internal_linking.related_jobs

Senior AI Research Engineer, Model Inference (Remote)

Tether Operations Limited • San Francisco, CA, United States

serp_jobs.filters.remote

serp_jobs.job_card.full_time

Join Tether and Shape the Future of Digital Finance.At Tether, we’re building solutions that empower businesses to integrate reserve-backed tokens across blockchains with transparency and trust in ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Research Engineer

OpenAI • San Francisco, CA, United States

serp_jobs.job_card.full_time

By applying to this role, you will be considered for Research Engineer roles across all teams at OpenAI.As a Research Engineer here, you will be responsible for building AI systems that can perform...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new

AI Robotics Research Engineer

Nimble Robotics • San Francisco, CA, United States

serp_jobs.job_card.full_time

Nimble is a robotics and AI company inventing and scaling autonomous logistics with intelligent robots to enable fast, efficient, and sustainable commerce. We’re developing generalized robot intelli...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

AI Research Engineer, Lead

Menlo Ventures • San Francisco, CA, United States

serp_jobs.job_card.full_time

The Technical Lead will drive AI research in one or more of the following areas : structure prediction, protein design, and lead optimization. PhD in AI, Machine Learning, Bioinformatics, or related ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

AI Engineer - Relational Foundation Models & Agentic Systems

Kumo • Mountain View, CA, US

serp_jobs.job_card.full_time

Kumo Relational Foundation Model (RFM).This is your chance to be part of that momentum.This is an engineering role at its core, but you’ll also. We’re looking for someone who thrives in ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One • San Francisco, CA, United States

serp_jobs.job_card.part_time

You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing.You want to work on problems that will help change banking for good.Passion for s...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Senior AI Research Engineer, Model Inference (100% Remote)

Tether Operations Limited • San Francisco, CA, US

serp_jobs.filters.remote

serp_jobs.job_card.full_time

Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days

Research Engineer - Casual AI

Mxv • San Francisco, CA, United States

serp_jobs.job_card.full_time

IC4; Estimated salary commensurate with experience.IC5; Estimated salary commensurate with experience.Our formula ensures new hires earn at or above real-time benchmarks. Our generous equity program...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Senior AI Engineer

Foundation Robotics Labs Inc. • San Francisco, CA, United States

serp_jobs.job_card.full_time

Our mission is to create advanced robots that can operate in complex environments, reducing human risk in conflict zones and enhancing efficiency in labor-intensive industries.We are on the lookout...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Senior AI Engineer

Rhythms • San Francisco, CA, United States

serp_jobs.job_card.full_time

Rhythms is building an AI operating system transforming ordinary teams into extraordinary ones through cutting-edge artificial intelligence. Founded by entrepreneurs behind Chronus (acquired by Priv...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Senior Research Engineer - Multimodal & Video Foundation Model (Remote)

Tether Operations Limited • San Francisco, CA, United States

serp_jobs.filters.remote

serp_jobs.job_card.full_time

Join Tether and Shape the Future of Digital Finance.At Tether, we’re pioneering a global financial revolution with solutions that empower businesses—from exchanges and wallets to payment processors...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

AI Inference Engineer

Perplexity AI • San Francisco, CA, US

serp_jobs.job_card.full_time

Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world's leading AI platforms. Perplexity has raised over $1B in venture investment from some ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Senior Research Engineer

Mem0 Official Documentation • San Francisco, CA, United States

serp_jobs.job_card.full_time

Own the end-to-end lifecycle of memory features—from research to production.You’ll fine-tune models for extraction, updates, consolidation / forgetting, and conflict resolution; turn customer pain po...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted

ML / AI Research Engineer — Agentic AI Lab (Founding Team)

Fabrion • San Francisco, CA, United States

serp_jobs.job_card.full_time

Competitive salary + meaningful equity (founding tier).Backed by 8VC, we\'re building a world-class team to tackle one of the industry’s most critical infrastructure problems.We’re designing the fu...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Generative AI Engineer (Remote)

Jobs via Dice • San Francisco, CA, United States

serp_jobs.filters.remote

serp_jobs.job_card.full_time

Generative AI Engineer (Remote).Generative AI Engineer (Remote).Y Combinator-backed Bio-Tech company is looking for a Senior Data Engineer to join their growing team!. This Jobot Job is hosted by : S...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

AI Research Engineer

Menlo Ventures • San Francisco, CA, United States

serp_jobs.job_card.full_time

Chai Discovery is unlocking new biology with artificial intelligence by building state-of-the-art foundation models.The company is operating in stealth mode by a highly experienced founding team, w...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Applied AI Inference Engineer

Baseten • San Francisco, CA, United States

serp_jobs.job_card.full_time

Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast.Backed by top investors including IVP, Spark Capital, Greylock, and Conviction, we’re ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

AI Research Engineer, Handshake AI

Handshake • San Francisco, CA, United States

serp_jobs.job_card.full_time

Handshake is building the future of human data for AI.We partner directly with top AI labs to power large language model (LLM) training and evaluation with high-quality, expert-generated data.As AI...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Memory Innovation Engineer, AI and Machine Learning

Micron Technology, Inc. • San Jose, CA, United States

serp_jobs.job_card.full_time

Our vision is to transform how the world uses information to enrich life for all.Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Senior AI / ML Engineer

Chime • San Francisco, CA, United States

serp_jobs.job_card.full_time

Chime’s AI / ML team is building models, services, and platforms that transform how millions of users manage and grow their financial lives. We are looking for a Senior AI / ML Engineer with deep techni...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted