Talent.com
Principal Engineer - AI Infrastructure Abstractions
Principal Engineer - AI Infrastructure AbstractionsDiversity Talent Scouts • San Jose, CA, US
Principal Engineer - AI Infrastructure Abstractions

Principal Engineer - AI Infrastructure Abstractions

Diversity Talent Scouts • San Jose, CA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Description

Job Description

As a Principal AI Infrastructure Abstraction Engineer , you will design and implement the foundational systems that make shared AI compute environments scalable, secure, and developer-friendly. Your work will focus on creating abstractions that hide hardware complexity while providing predictable, cloud-native interfaces for AI workloads.

This position bridges infrastructure and applied AI—turning raw GPUs and accelerators into programmable, elastic, and multi-tenant resources for both internal developers and enterprise clients.

Key Responsibilities

  • Architect abstractions that map logical compute constructs (vGPUs, GPU pools, workload queues) to physical devices.
  • Build APIs, services, and control planes that expose GPU and accelerator resources with strong isolation and quality-of-service guarantees.
  • Develop mechanisms for secure GPU sharing, including time-slicing, partitioning, and namespace isolation.
  • Work with orchestration and scheduling systems to ensure intelligent mapping of resources based on utilization, priority, and network topology.
  • Define policies for quotas, fair allocation, and resource elasticity in shared environments.
  • Integrate with AI / ML frameworks (PyTorch, TensorFlow, Triton, etc.) to optimize model training and inference workflows.
  • Deliver observability and monitoring capabilities that trace resource usage from logical abstractions to hardware.
  • Partner with platform security teams to strengthen access controls, onboarding processes, and tenant isolation.
  • Support internal developer adoption of abstraction APIs while maintaining high performance and low overhead.
  • Contribute to long-term compute platform strategy with a focus on modularity, abstraction, and scale.

Minimum Qualifications

  • Bachelor’s degree with 15+ years of experience, Master’s with 12+ years, or PhD with 8+ years.
  • Proven track record building production-grade infrastructure systems, preferably in Go, Python, or C++.
  • Strong experience with containerization and orchestration platforms (Kubernetes, Docker, KubeVirt).
  • Background in designing logical abstractions for compute, storage, or networking in multi-tenant systems.
  • Familiarity with integrating with machine learning platforms (e.g., PyTorch, TensorFlow, Triton, MLFlow).
  • Preferred Qualifications

  • Hands-on experience with GPU sharing, scheduling, or isolation (MIG, MPS, vGPUs, time-slicing, or device plugin models).
  • Deep knowledge of resource management : quotas, prioritization, fairness, elasticity.
  • Strong ability to think across hardware / software boundaries and design abstractions that scale.
  • serp_jobs.job_alerts.create_a_job

    Principal Engineer Ai • San Jose, CA, US

    Job_description.internal_linking.related_jobs
    Senior AI Infrastructure Engineer

    Senior AI Infrastructure Engineer

    VirtualVocations • Santa Clara, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior AI Infrastructure Engineer, Cloud Partnerships - DGX Cloud.Key Responsibilities Architect unified systems for integrating infrastructure provider maintenance eve...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal AI Engineer

    Principal AI Engineer

    Synopsys • Mountain View, CA, United States
    serp_jobs.job_card.full_time
    You are a passionate and driven individual with a degree in Computer Science, Computer Engineering, or Electrical Engineering. With a strong foundation in Artificial Intelligence algorithms and expe...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Engineer, Cyber Threat Intelligence

    Principal Engineer, Cyber Threat Intelligence

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Engineer - Cyber Threat Intelligence.Key Responsibilities Lead advanced research and analysis of cyber adversary tactics and procedures Produce threat intell...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal Solutions Architect

    Principal Solutions Architect

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Solutions Architect.Key Responsibilities Conduct discovery and definition sessions to understand customer goals and challenges Lead solution workshops and so...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Infrastructure Engineer, Model Serving Platform

    AI Infrastructure Engineer, Model Serving Platform

    Scale AI, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Sales Engineer

    Principal Sales Engineer

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Sales Engineer to drive technology exploratory stages of the sales process and serve as a technical advisor. Key Responsibilities Drive the technology explorat...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Platform Engineer II

    Platform Engineer II

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Platform Engineer II - Enterprise Storage Support Engineer.Key Responsibilities Designs, engineers, and implements systems infrastructure Proactively manages and monit...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Principal Security Engineer

    Principal Security Engineer

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Security Engineer to lead information security initiatives and collaborate with development and operational teams. Key Responsibilities Identify security threa...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Engineer Cyber Countermeasures

    Principal Engineer Cyber Countermeasures

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Engineer - Cyber Countermeasures.Key Responsibilities Lead the design and implementation of cyber countermeasures against advanced adversary tactics Develop ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI Systems Engineer

    AI Systems Engineer

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an Engineering & AI Systems Engineer to design and implement internal tools that enhance operational efficiency. Key Responsibilities Build and deploy internal tools to ad...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Data Engineer - AI

    Principal Data Engineer - AI

    VirtualVocations • Oakland, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Data Engineer - AI (REMOTE).Key Responsibilities Define and drive the technical vision for data platforms supporting AI-powered features Lead the design and ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal Core Network Engineer

    Principal Core Network Engineer

    VirtualVocations • Hayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Core Network Engineer to provide oversight and technical leadership for its networking team. Key Responsibilities Develop and maintain high-level design and st...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Principal Verification Engineer

    Principal Verification Engineer

    OSI Engineering • Menlo Park, CA, US
    serp_jobs.job_card.full_time
    A leading chip and silicon IP provider is seeking a talented Principal Verification Engineer to join its Memory Interconnect Design team. In this full-time hybrid role, you’ll work alongside world-c...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AWS Certified Infrastructure Engineer

    AWS Certified Infrastructure Engineer

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an Infrastructure Systems Engineer to manage infrastructure for federal government and private sector clients. Key Responsibilities Design, deploy, and manage AWS environm...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Principal Engineer

    Principal Engineer

    VirtualVocations • Oakland, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Engineer to provide architectural and technical leadership across its product platform. Key Responsibilities Drive holistic software architecture across teams ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Software Architect

    Principal Software Architect

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Software Architect - Back End.Key Responsibilities Own back-end architecture strategy for enterprise software applications, including APIs and microservices ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Engineer, AI Architect

    Principal Engineer, AI Architect

    VirtualVocations • San Francisco, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Engineer, AI Architect to design and implement innovative Generative AI solutions.Key Responsibilities Architect and build next-generation Conversational AI p...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Engineer - IAM

    Principal Engineer - IAM

    VirtualVocations • Oakland, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Engineer - Identity Management (IAM & Golang Backend).Key Responsibilities Architect and design the next-generation identity platform for authentication and a...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Principal Data Engineer

    Principal Data Engineer

    VirtualVocations • San Francisco, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Data Engineer to join their Data Engineering team.Key Responsibilities Design, implement, and maintain data models, pipelines, and architecture for healthcare...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Platform & Infrastructure Engineer

    Platform & Infrastructure Engineer

    Mindsdb • San Francisco, CA, US
    serp_jobs.job_card.full_time
    Job description ABOUT USMindsDB is a fast-growing AI startup headquartered in San Francisco, California.MindsDB is an AI Analytics solution that connects to diverse data sources and applications th...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted