Talent.com
AI Inference Engineer
AI Inference EngineerPerplexity AI • San Francisco, CA, US
AI Inference Engineer

AI Inference Engineer

Perplexity AI • San Francisco, CA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Description

Job Description

Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world's leading AI platforms. Perplexity has raised over $1B in venture investment from some of the world's most visionary and successful leaders, including Elad Gil, Daniel Gross, Jeff Bezos, Accel, IVP, NEA, NVIDIA, Samsung, and many more. Our objective is to build accurate, trustworthy AI that powers decision-making for people and assistive AI wherever decisions are being made. Throughout human history, change and innovation have always been driven by curious people. Today, curious people use Perplexity to answer more than 780 million queries every month–a number that's growing rapidly for one simple reason : everyone can be curious.

We are looking for an AI Inference engineer to join our growing team. Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.

Responsibilities

  • Develop APIs for AI inference that will be used by both internal and external customers
  • Benchmark and address bottlenecks throughout our inference stack
  • Improve the reliability and observability of our systems and respond to system outages
  • Explore novel research and implement LLM inference optimizations

Qualifications

  • Experience with ML systems and deep learning frameworks (e.g. PyTorch, TensorFlow, ONNX)
  • Familiarity with common LLM architectures and inference optimization techniques (e.g. continuous batching, quantization, etc.)
  • Understanding of GPU architectures or experience with GPU kernel programming using CUDA
  • The cash compensation range for this role is $190,000 - $250,000.

    Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.

    Equity : In addition to the base salary, equity may be part of the total compensation package.

    Benefits : Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.

    serp_jobs.job_alerts.create_a_job

    Ai Engineer • San Francisco, CA, US

    Job_description.internal_linking.related_jobs
    Principal AI Engineer

    Principal AI Engineer

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal AI Engineer.Key Responsibilities Design and develop advanced AI solutions using large language models (LLMs) and agent frameworks Architect and implement LLM...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Solutions Engineer

    AI Solutions Engineer

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Solutions Engineer, Pre-Sales.Key Responsibilities Lead technical discovery, demos, and PoVs / PoCs from planning to technical win Translate customer challenges into...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    VirtualVocations • Hayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior AI / ML Engineer to design, develop, and deploy scalable AI / ML solutions in a mission-critical environment. Key Responsibilities Design and implement scalable machi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Researcher for Conversational AI

    AI Researcher for Conversational AI

    VirtualVocations • Hayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Researcher specializing in Large Language Models.Key Responsibilities Conduct research on large language modeling and adaptation for Conversational Avatars Develop...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal AI Engineer

    Principal AI Engineer

    Synopsys • Mountain View, CA, United States
    serp_jobs.job_card.full_time
    You are a passionate and driven individual with a degree in Computer Science, Computer Engineering, or Electrical Engineering. With a strong foundation in Artificial Intelligence algorithms and expe...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Staff AI Engineer

    Senior Staff AI Engineer

    VirtualVocations • San Jose, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Staff AI Engineer to design and deploy advanced AI systems.Key Responsibilities Architect and implement frameworks for autonomous AI agents, ensuring security an...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    AI Integration Engineer

    AI Integration Engineer

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Staff Game Engineer to build and support AI research applications in games.Key Responsibilities Integrate AI agents with games Build tools for debugging and improving ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Workflow Engineer

    AI Workflow Engineer

    VirtualVocations • Hayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Workflow Engineer to lead the design and implementation of an automated invoice-processing pipeline. Key Responsibilities Develop and deploy end-to-end AI-driven wor...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Applied AI Engineer

    Applied AI Engineer

    VirtualVocations • San Jose, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an Applied AI Engineer to design and deploy AI systems for construction workflows.Key Responsibilities Build and deploy agentic AI systems that automate construction work...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI / ML Engineer

    AI / ML Engineer

    VirtualVocations • Concord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI / ML Engineer.Key Responsibilities Develop software in Python, C#, and C++ for LLM training, testing, and inference modules Lead the creation and implementation of A...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Agentic AI Engineer

    Agentic AI Engineer

    VirtualVocations • San Jose, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an Agent Forward Engineer to lead the design and deployment of AI agents and software for application development and migration. Key Responsibilities Design and implement ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Staff Engineer, AI

    Senior Staff Engineer, AI

    VirtualVocations • Hayward, California, United States
    serp_jobs.job_card.full_time
    Staff / Staff Engineer, AI Developer Experience.Key Responsibilities Build and maintain developer workflow tools to enhance the overall development experience Lead and mentor other engineers, ensu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal AI Engineer, Intelligent Sensors

    Principal AI Engineer, Intelligent Sensors

    1010 Analog Devices Inc. • Rio Robles, CA, United States
    serp_jobs.job_card.full_time +1
    NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Lead AI Engineer

    Lead AI Engineer

    VirtualVocations • San Jose, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Lead AI Engineer to architect and deploy machine learning and AI systems across its payments platform. Key Responsibilities Lead the design, development, and deployment ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Engineer III

    AI Engineer III

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Engineer III to lead the design and deployment of advanced AI systems.Key Responsibilities Define architecture, standards, and evaluation strategies for AI systems ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI Engineer

    AI Engineer

    VirtualVocations • San Jose, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Engineer to enhance AI-driven Electronic Health Record (EHR) systems.Key Responsibilities : Develop and refine prompts for optimal performance of Large Language Mode...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Security Engineer

    AI Security Engineer

    VirtualVocations • Concord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Security Engineer with a focus on AI.Key Responsibilities Support ongoing security operations including monitoring, incident response, and risk assessment Assess and m...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal AI / ML Engineer, Security AI

    Principal AI / ML Engineer, Security AI

    Cisco Systems, Inc. • San Jose, CA, United States
    serp_jobs.job_card.full_time
    The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world can defend against threats and safeguard the most vital aspe...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted