Principal Engineer - AI Infrastructure AbstractionsDiversity Talent Scouts • San Jose, CA, US

Principal Engineer - AI Infrastructure Abstractions

Diversity Talent Scouts • San Jose, CA, US

job_description.job_card.30_days_ago

serp_jobs.job_preview.job_type

serp_jobs.job_card.full_time

job_description.job_card.job_description

Job Description

As a Principal AI Infrastructure Abstraction Engineer , you will design and implement the foundational systems that make shared AI compute environments scalable, secure, and developer-friendly. Your work will focus on creating abstractions that hide hardware complexity while providing predictable, cloud-native interfaces for AI workloads.

This position bridges infrastructure and applied AI—turning raw GPUs and accelerators into programmable, elastic, and multi-tenant resources for both internal developers and enterprise clients.

Key Responsibilities

Architect abstractions that map logical compute constructs (vGPUs, GPU pools, workload queues) to physical devices.
Build APIs, services, and control planes that expose GPU and accelerator resources with strong isolation and quality-of-service guarantees.
Develop mechanisms for secure GPU sharing, including time-slicing, partitioning, and namespace isolation.
Work with orchestration and scheduling systems to ensure intelligent mapping of resources based on utilization, priority, and network topology.
Define policies for quotas, fair allocation, and resource elasticity in shared environments.
Integrate with AI / ML frameworks (PyTorch, TensorFlow, Triton, etc.) to optimize model training and inference workflows.
Deliver observability and monitoring capabilities that trace resource usage from logical abstractions to hardware.
Partner with platform security teams to strengthen access controls, onboarding processes, and tenant isolation.
Support internal developer adoption of abstraction APIs while maintaining high performance and low overhead.
Contribute to long-term compute platform strategy with a focus on modularity, abstraction, and scale.

Minimum Qualifications

Bachelor’s degree with 15+ years of experience, Master’s with 12+ years, or PhD with 8+ years.

Proven track record building production-grade infrastructure systems, preferably in Go, Python, or C++.

Strong experience with containerization and orchestration platforms (Kubernetes, Docker, KubeVirt).

Background in designing logical abstractions for compute, storage, or networking in multi-tenant systems.

Familiarity with integrating with machine learning platforms (e.g., PyTorch, TensorFlow, Triton, MLFlow).

Preferred Qualifications

Hands-on experience with GPU sharing, scheduling, or isolation (MIG, MPS, vGPUs, time-slicing, or device plugin models).

Deep knowledge of resource management : quotas, prioritization, fairness, elasticity.

Strong ability to think across hardware / software boundaries and design abstractions that scale.

serp_jobs.job_alerts.create_a_job

Principal Engineer Ai • San Jose, CA, US

Job_description.internal_linking.related_jobs

Senior AI Infrastructure Engineer

VirtualVocations • Santa Clara, California, United States

serp_jobs.job_card.full_time

A company is looking for a Senior AI Infrastructure Engineer, Cloud Partnerships - DGX Cloud.Key Responsibilities Architect unified systems for integrating infrastructure provider maintenance eve...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Principal AI Engineer

Synopsys • Mountain View, CA, United States

serp_jobs.job_card.full_time

You are a passionate and driven individual with a degree in Computer Science, Computer Engineering, or Electrical Engineering. With a strong foundation in Artificial Intelligence algorithms and expe...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Principal Engineer, Cyber Threat Intelligence

VirtualVocations • Fremont, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Engineer - Cyber Threat Intelligence.Key Responsibilities Lead advanced research and analysis of cyber adversary tactics and procedures Produce threat intell...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Principal Solutions Architect

VirtualVocations • Fremont, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Solutions Architect.Key Responsibilities Conduct discovery and definition sessions to understand customer goals and challenges Lead solution workshops and so...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

AI Infrastructure Engineer, Model Serving Platform

Scale AI, Inc. • San Francisco, CA, United States

serp_jobs.job_card.full_time

As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Principal Sales Engineer

VirtualVocations • Fremont, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Sales Engineer to drive technology exploratory stages of the sales process and serve as a technical advisor. Key Responsibilities Drive the technology explorat...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Platform Engineer II

VirtualVocations • Fremont, California, United States

serp_jobs.job_card.full_time

A company is looking for a Platform Engineer II - Enterprise Storage Support Engineer.Key Responsibilities Designs, engineers, and implements systems infrastructure Proactively manages and monit...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new

Principal Security Engineer

VirtualVocations • Fremont, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Security Engineer to lead information security initiatives and collaborate with development and operational teams. Key Responsibilities Identify security threa...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Principal Engineer Cyber Countermeasures

VirtualVocations • Fremont, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Engineer - Cyber Countermeasures.Key Responsibilities Lead the design and implementation of cyber countermeasures against advanced adversary tactics Develop ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

AI Systems Engineer

VirtualVocations • Fremont, California, United States

serp_jobs.job_card.full_time

A company is looking for an Engineering & AI Systems Engineer to design and implement internal tools that enhance operational efficiency. Key Responsibilities Build and deploy internal tools to ad...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Principal Data Engineer - AI

VirtualVocations • Oakland, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Data Engineer - AI (REMOTE).Key Responsibilities Define and drive the technical vision for data platforms supporting AI-powered features Lead the design and ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Principal Core Network Engineer

VirtualVocations • Hayward, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Core Network Engineer to provide oversight and technical leadership for its networking team. Key Responsibilities Develop and maintain high-level design and st...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted

Principal Verification Engineer

OSI Engineering • Menlo Park, CA, US

serp_jobs.job_card.full_time

A leading chip and silicon IP provider is seeking a talented Principal Verification Engineer to join its Memory Interconnect Design team. In this full-time hybrid role, you’ll work alongside world-c...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

AWS Certified Infrastructure Engineer

VirtualVocations • Fremont, California, United States

serp_jobs.job_card.full_time

A company is looking for an Infrastructure Systems Engineer to manage infrastructure for federal government and private sector clients. Key Responsibilities Design, deploy, and manage AWS environm...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted

Principal Engineer

VirtualVocations • Oakland, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Engineer to provide architectural and technical leadership across its product platform. Key Responsibilities Drive holistic software architecture across teams ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Principal Software Architect

VirtualVocations • Fremont, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Software Architect - Back End.Key Responsibilities Own back-end architecture strategy for enterprise software applications, including APIs and microservices ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Principal Engineer, AI Architect

VirtualVocations • San Francisco, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Engineer, AI Architect to design and implement innovative Generative AI solutions.Key Responsibilities Architect and build next-generation Conversational AI p...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Principal Engineer - IAM

VirtualVocations • Oakland, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Engineer - Identity Management (IAM & Golang Backend).Key Responsibilities Architect and design the next-generation identity platform for authentication and a...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new

Principal Data Engineer

VirtualVocations • San Francisco, California, United States

serp_jobs.job_card.full_time

A company is looking for a Principal Data Engineer to join their Data Engineering team.Key Responsibilities Design, implement, and maintain data models, pipelines, and architecture for healthcare...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Platform & Infrastructure Engineer

Mindsdb • San Francisco, CA, US

serp_jobs.job_card.full_time

Job description ABOUT USMindsDB is a fast-growing AI startup headquartered in San Francisco, California.MindsDB is an AI Analytics solution that connects to diverse data sources and applications th...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted