Talent.com
serp_jobs.error_messages.no_longer_accepting
LLM Inference Engineer (Palo Alto)

LLM Inference Engineer (Palo Alto)

Hippocratic AIPalo Alto, CA, United States
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About Us

Hippocratic AI has developed a safety-focused Large Language Model (LLM) for healthcare. The company believes that a safe LLM can dramatically improve healthcare accessibility and health outcomes in the world by bringing deep healthcare expertise to every human. No other technology has the potential to have this level of global impact on health.

Why Join Our Team

Innovative Mission : We are developing a safe, healthcare-focused large language model (LLM) designed to revolutionize health outcomes on a global scale.

Visionary Leadership : Hippocratic AI was co-founded by CEO Munjal Shah, alongside a group of physicians, hospital administrators, healthcare professionals, and artificial intelligence researchers from leading institutions, including El Camino Health, Johns Hopkins, Stanford, Microsoft, Google, and NVIDIA.

Strategic Investors : We have raised a total of $278 million in funding, backed by top investors such as Andreessen Horowitz, General Catalyst, Kleiner Perkins, NVIDIA's NVentures, Premji Invest, SV Angel, and six health systems.

World-Class Team : Our team is composed of leading experts in healthcare and artificial intelligence, ensuring our technology is safe, effective, and capable of delivering meaningful improvements to healthcare delivery and outcomes.

For more information, visit www.HippocraticAI.com.

We value in-person teamwork and believe the best ideas happen together. Our team is expected to be in the office five days a week in Palo Alto, CA unless explicitly noted otherwise in the job description.

About the Role

We're seeking an experienced LLM Inference Engineer to optimize our large language model (LLM) serving infrastructure. The ideal candidate has :

Extensive hands-on experience with state-of-the-art inference optimization techniques

A track record of deploying efficient, scalable LLM systems in production environments

Key Responsibilities

Design and implement multi-node serving architectures for distributed LLM inference

Optimize multi-LoRA serving systems

Apply advanced quantization techniques (FP4 / FP6) to reduce model footprint while preserving quality

Implement speculative decoding and other latency optimization strategies

Develop disaggregated serving solutions with optimized caching strategies for prefill and decoding phases

Continuously benchmark and improve system performance across various deployment scenarios and GPU types

Required Qualifications

2+ years of experience optimizing LLM inference systems at scale

Proven expertise with distributed serving architectures for large language models

Hands-on experience implementing quantization techniques for transformer models

Strong understanding of modern inference optimization methods, including :

Speculative decoding techniques with draft models

Eagle speculative decoding approaches

Proficiency in Python and C++

Experience with CUDA programming and GPU optimization (familiarity required, expert-level not necessary)

Preferred Qualifications

Contributions to open-source inference frameworks such as vLLM, SGLang, or TensorRT-LLM

Experience with custom CUDA kernels

Track record of deploying inference systems in production environments

Deep understanding of performance optimization systems

Show us what you've built : Tell us about an LLM inference or training project that makes you proud! Whether you've optimized inference pipelines to achieve breakthrough performance, designed innovative training techniques, or built systems that scale to billions of parameters - we want to hear your story.

Open source contributor? Even better! If you've contributed to projects like vllm, sglang, lmdeploy or similar LLM optimization frameworks, we'd love to see your PRs. Your contributions to these communities demonstrate exactly the kind of collaborative innovation we value. Join a team where your expertise won't just be appreciatedit will be celebrated and amplified. Help us shape the future of AI deployment at scale!

References

1. Polaris : A Safety-focused LLM Constellation Architecture for Healthcare, https : / / arxiv.org / abs / 2403.133132. Polaris 2 : https : / / www.hippocraticai.com / polaris23. Personalized Interactions : https : / / www.hippocraticai.com / personalized-interactions4. Human Touch in AI : https : / / www.hippocraticai.com / the-human-touch-in-ai5. Polaris 1 : https : / / www.hippocraticai.com / research / polaris7. Research and clinical blogs : https : / / www.hippocraticai.com / research

serp_jobs.job_alerts.create_a_job

Engineer Palo Alto • Palo Alto, CA, United States

Job_description.internal_linking.related_jobs
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
QA Analyst - Ignition MES

QA Analyst - Ignition MES

VirtualVocationsSan Jose, California, United States
serp_jobs.job_card.full_time
A company is looking for a MES QA Analyst - Ignition Track / Trace.Key Responsibilities : Design, develop, and execute test cases for Ignition MES Track & Trace workflows Validate Jython scripts an...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
Salesforce QA Engineer Lead

Salesforce QA Engineer Lead

VirtualVocationsSan Jose, California, United States
serp_jobs.job_card.full_time
A company is looking for a Salesforce QA / Test Engineering Lead.Key Responsibilities : Lead the Salesforce QA team to define test strategies and execute testing to meet stakeholder requirements De...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
QA Associate

QA Associate

VirtualVocationsFremont, California, United States
serp_jobs.job_card.full_time
A company is looking for a QA Associate.Key Responsibilities Provides quality assurance support related to documentation processes and systems Responsible for filing and maintenance of controlle...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
ATE Test Engineer

ATE Test Engineer

JobotMountain View, CA, US
serp_jobs.job_card.full_time
Help bring cutting-edge products to life in a high-impact, high-growth engineering environment.This Jobot Job is hosted by : Brendan Thomas. Are you a fit? Easy Apply now by clicking the "Apply ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Test Req

Test Req

VisaUnion City, CA, United States
serp_jobs.job_card.full_time
Visa is a world leader in digital payments, facilitating more than 215 billion payments transactions between consumers, merchants, financial institutions and government entities across more than 20...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
EPLI Project Analyst

EPLI Project Analyst

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for an EPLI Project Analyst to assist with legal project management and client service initiatives.Key Responsibilities Facilitate matter intake and allocation of legal resou...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Test Lead - Automation

Test Lead - Automation

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a Test Lead - Automation.Key Responsibilities Analyze requirements and acceptance criteria to design test cases and create test automation scripts Develop and implement ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
Tech Audit Senior

Tech Audit Senior

WithumSan Francisco, CA, United States
serp_jobs.job_card.full_time
Withum is a place where talent thrives - where who you are matters.It's a place of endless opportunities for growth.A place where entrepreneurial energy plus inclusive teamwork equals exponential r...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
QA Automation Team Lead

QA Automation Team Lead

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a QA Automation Team Lead for Data Platform.Key Responsibilities Build, guide, and mentor a high-performing engineering team while driving quality assurance and automatio...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
Engr, Assoc Test 2

Engr, Assoc Test 2

KLAMilpitas, CA, United States
serp_jobs.job_card.full_time
KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem.Virtually every electronic device in the world is produced using our technologies.No laptop, smartpho...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
QA Analyst III

QA Analyst III

VirtualVocationsSan Jose, California, United States
serp_jobs.job_card.full_time
A company is looking for a QA Analyst III to support the Developer Compliance Operations team in ensuring the integrity and quality of Data Protection Assessment reviews.Key Responsibilities Inde...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
Validation Engineer, Cryogenic

Validation Engineer, Cryogenic

PsiQuantumMilpitas, CA, United States
serp_jobs.job_card.full_time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
ServiceNow QA Engineer

ServiceNow QA Engineer

VirtualVocationsSan Jose, California, United States
serp_jobs.job_card.full_time
A company is looking for a ServiceNow Automated Test Framework QA Engineer / Tester (Remote).Key Responsibilities Drive ATF QA efforts across ServiceNow implementations and enhancements Develop an...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
Epic PB Certified Analyst

Epic PB Certified Analyst

VirtualVocationsOakland, California, United States
serp_jobs.job_card.full_time
A company is looking for an Epic PB SME.Key Responsibilities Conduct optimization and assessment engagements for Epic PB Lead stakeholder sessions with executive-level communication Coordinate ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
QA Engineer (VN155AP2025)

QA Engineer (VN155AP2025)

40HRS, Inc.Fremont, CA, US
serp_jobs.job_card.full_time
Quality Engineer Fremont, CA (on-site) Promote and Enforce company ISO standard to ensure that targets are achieved Collaborate with all departments and quality team to ensure all staffs are workin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
TPA Analyst

TPA Analyst

VirtualVocationsSan Jose, California, United States
serp_jobs.job_card.full_time
A company is looking for a TPA Analyst to assist in plan administration and support a team environment.Key Responsibilities Assist Associates and Consultants with Defined Contribution Balance For...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
UX Researcher for Training

UX Researcher for Training

VirtualVocationsHayward, California, United States
serp_jobs.job_card.full_time
A company is looking for a UX Researcher (Training & Education) to design and refine training courses related to prevention of sexual assault and other harmful behaviors in the military.Key Respons...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
Tennessee Licensed Automation Tester

Tennessee Licensed Automation Tester

VirtualVocationsSan Francisco, California, United States
serp_jobs.job_card.full_time
A company is looking for an Automation Tester to support government projects focused on health and social services.Key Responsibilities Build automation frameworks and write / executing UI and API ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
Data Integrity Specialist

Data Integrity Specialist

VirtualVocationsSan Jose, California, United States
serp_jobs.job_card.full_time
A company is looking for a Data Integrity Specialist.Key Responsibilities Coordinate and facilitate chart corrections and data quality issues across multiple departments Provide customer service...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
Epic PB Analyst

Epic PB Analyst

VirtualVocationsSan Francisco, California, United States
serp_jobs.job_card.temporary
A company is looking for an Epic PB Analyst for a 6+ month contract position.Key Responsibilities Mentor and guide client builders in environments with limited system access Provide hands-on bui...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours