Talent.com
Software Engineer (AI Performance)

Software Engineer (AI Performance)

Gimlet Labs, IncSan Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Gimlet Labs is building the foundation for the next generation of AI applications. As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck. Gimlet is redefining AI inference from the ground up, combining cutting-edge research with an integrated hardware-software stack that delivers breakthrough performance, efficiency, and model quality. Gimlet pairs its inference stack with a seamless developer experience, allowing users to deploy, manage, and monitor AI workloads from frameworks like PyTorch and LangChain at production scale in seconds.

Gimlet is spun out of a Stanford research project under Professors Zain Asgar and Sachin Katti. The founding team has deep experience across AI, distributed systems, and hardware with previous successful exits.

Gimlet Labs is seeking a Software Engineer focused on AI Performance. You will be researching and implementing techniques to drive performance and quality optimizations across the latest AI models. You will implement techniques such as quantization, KV caching, and FlashAttention to enable inference efficiency. You will design parallelism strategies to distribute data and workloads across compute nodes at production scale. You will dive deep into GPU code and kernel optimizations to accelerate AI workloads.

Responsibilities

  • Evaluating and implementing cutting-edge AI research for model performance and efficiency
  • Architecting infrastructure for distributed AI workloads across both the software stack and GPU kernel layers
  • Profiling, benchmarking, and analyzing system performance, identifying bottlenecks and optimization opportunities in execution runtimes targeting various hardware systems

Qualifications

  • Bachelor’s degree in computer science, engineering, applied mathematics or comparable area of study
  • Experience with performance optimization
  • Preferred Qualifications

  • Graduate degree in computer science, engineering, applied mathematics or comparable area of study
  • Familiarity with compilers and compiler frameworks such as MLIR
  • Experience with PyTorch, TensorFlow, vLLM, ONNX and other AI frameworks
  • Software development experience with Python, C++, and CUDA
  • #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Software Engineer Ai • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    AI Automation Engineer

    AI Automation Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI & Automation Engineer to join their IT & Security Enterprise Applications team.Key Responsibilities Implement AI-powered solutions and integrate AI with enterprise ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Staff Software Engineer - AI / ML, GenAI.Key Responsibilities Design, build, and deploy AI / ML models and solutions using Python and other scripting languages Develop and...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    AI Engineer Lead

    AI Engineer Lead

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Lead or Principal AI Engineer.Key Responsibilities Lead the design, architecture, and execution of AI platforms for autonomous decision-making and intelligent automatio...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    AI-Native Engineer

    AI-Native Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI-Native Engineer (Backend / Full-stack).Key Responsibilities Develop and scale the web app and Word add-in for contract negotiation automation Integrate LLMs and desi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    AI Architect

    AI Architect

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Architect to lead artificial intelligence and machine learning practices focused on industrial applications. Key Responsibilities Design and oversee the implementati...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Software AI / ML Developer

    Principal Software AI / ML Developer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Remote Principal Software AI / ML Developer.Key Responsibilities Architect and implement scalable applications using large language models (LLMs) and develop effective re...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Software Engineer, AI Systems

    Senior Software Engineer, AI Systems

    VirtualVocationsSan Francisco, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Software Engineer, AI Systems - vLLM and MLPerf.Key Responsibilities Design and implement efficient inference systems for generative AI models Define benchmarki...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Software Engineer, Enterprise AI

    Software Engineer, Enterprise AI

    Scale AI, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Full-Stack AI Engineer

    Full-Stack AI Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Full-Stack AI Engineer.Key Responsibilities Prototyping, building, and launching AI experiences for large retailers and enterprises Partnering with PM, sales, and clie...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Lead Enterprise AI Workshops

    Lead Enterprise AI Workshops

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Lead for Enterprise AI Workshops.Key Responsibilities Run and grow the enterprise workshop business, ensuring revenue and delivery targets are met Deliver high-impact ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    AI Software Engineer

    AI Software Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Software Engineer to lead the development of AI and ML solutions.Key Responsibilities Contribute to prototype efforts and deliver production-ready features with hig...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Engineer, AI

    Principal Engineer, AI

    VirtualVocationsSanta Clara, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Engineer, AI Agents.Key Responsibilities Architect foundational AI strategy and drive capabilities for agentic AI and advanced analytics Design scalable AI-d...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Engineer, AI Architecture

    Principal Engineer, AI Architecture

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Engineer, AI Agents.Key Responsibilities Architect foundational AI strategy and capabilities for agentic AI and advanced analytics Design scalable AI-driven ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    AI Product Engineer

    AI Product Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Product Engineer (LLM-first Full-Stack).Key Responsibilities Design and ship end-to-end product features across frontend, backend, and AI services Utilize AI assis...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    AI Engineer

    AI Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Lead AI Engineer to develop and deploy machine learning models that impact the automotive industry. Key Responsibilities Develop and deploy multiple AI models into produ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Sr. Software Engineer- AI / LLM

    Sr. Software Engineer- AI / LLM

    SupermicroSan Jose, CA, United States
    serp_jobs.job_card.full_time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Automation Engineer

    Automation Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an Automation Engineer (Platform Support) to innovate and streamline infrastructure operations through automation. Key Responsibilities Manage and maintain Terraform and A...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Software Engineer, Full-Stack - Enterprise Gen AI

    Senior Software Engineer, Full-Stack - Enterprise Gen AI

    Scale AI, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Senior Software Engineer, Full-Stack - Enterprise Gen AI.Scale GP (Scale Generative AI Platform) is an enterprise-grade AI platform providing APIs for knowledge retrieval, inference, evaluation, an...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior AI Engineer

    Senior AI Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior AI Agent Engineer (Go).Key Responsibilities Design and develop AI agents using Go programming language Collaborate with cross-functional teams to integrate AI s...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff AI Engineer

    Staff AI Engineer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Staff AI Engineer to develop advanced AI-powered mental health tools.Key Responsibilities Design, train, fine-tune, and evaluate machine learning and large language mod...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30