Talent.com
Cantina is hiring : Inference Engineer, Video AI in San Francisco

Cantina is hiring : Inference Engineer, Video AI in San Francisco

MediabistroSan Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

A bit about Cantina :

Cantina, founded by Sean Parker, is a new social platform with the most advanced AI character creator. Build, share, and interact with AI bots and your friends directly in the Cantina or across the internet.

Cantina bots are lifelike, social creatures, capable of interacting wherever humans go on the internet. Recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters. Bots are a new media type that offer a way for creators to share infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text.

If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

A bit about the role : We're looking for an Inference Engineer who specializes in productionizing and hosting video AI models at scale. You'll be responsible for taking cutting-edge neural networks from research to production, building robust inference infrastructure, and optimizing model performance for real-time applications. This role focuses on the deployment and serving of large video models.

As an Inference Engineer, you will :

  • Deploy video AI models to production - Take research models and build production-ready inference endpoints with APIs, ensuring efficient operation across cloud infrastructure.
  • Maintain and optimize inference systems - Debug complex model serving issues, optimize latency performance, monitor system health, and ensure 99.9% uptime for AI-powered features.
  • Implement model optimizations - Work with neural network architectures including diffusion networks, VAEs, and transformers. Apply streaming optimizations and understand video model architectures to implement effective performance improvements.
  • Manage inference infrastructure - Leverage containerization with Docker, cloud storage solutions like S3, and cluster computing to build scalable model serving infrastructure.
  • Collaborate with research teams - Work closely with AI researchers to understand model requirements, architectural constraints, and optimization opportunities for new video generation models.

A bit about you :

  • 2+ years of ML engineering experience with focus on model inference and deployment
  • Strong understanding of neural network architectures , particularly diffusion networks, VAEs, and transformer models
  • Experience with video and image models - Understanding of how video / image generation models work, their architectures, and optimization strategies specific to video processing
  • Multi-GPU inference expertise - Experience running model components across multiple GPUs, implementing parallel processing strategies for large models
  • Production model hosting experience - Track record of deploying and maintaining ML models in production environments, including streaming and real-time inference
  • Experience with containerization (Docker), AWS, and cluster computing environments
  • Familiarity with machine learning frameworks (PyTorch, TensorFlow)
  • Experience with inference platforms and model serving solutions
  • Technical Stack You'll Work With :

  • Cloud : AWS (S3, DynamoDB), Kubernetes clusters
  • ML Infrastructure : Model serving platforms, Docker
  • Languages : Python
  • Frameworks : PyTorch, TensorFlow
  • Models : Video generation models, diffusion networks, VAEs, transformers
  • Optimization : Multi-GPU inference, real-time processing techniques

    Pay Equity :

    In compliance with Pay Transparency Laws, the base salary range for this role is between $175,000-$225,000 for those located in the San Francisco Bay Area, New York City and Seattle, WA. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

    Benefits :

  • Health Care - 99% of premiums for medical, vision, dental are fully paid for by Cantina, plus One Medical membership.
  • Monthly Wellness Stipend - $500 / month to use on whatever you'd like!
  • Rest and Recharge - 15 PTO days per year, 10 sick days, all Federal holidays, and 2 floating holidays.
  • 401(K) - Eligible to participate on day one of employment.
  • Parental Leave & Fertility Support
  • Competitive Salary & Equity
  • Lunch and snacks provided for in-office employees.
  • WFH equipment provided for full-time hybrid / remote employees.
  • In Summary : Cantina is a new social platform with the most advanced AI character creator . Cantina bots are lifelike, social creatures, capable of interacting wherever humans go on the internet . We're looking for an Inference Engineer who specializes in productionizing and hosting video AI models at scale .

    En Español : Cantina, fundada por Sean Parker, es una nueva plataforma social con el creador de personajes AI más avanzado. Construye, comparta e interactúa con bots de IA y sus amigos directamente en la Cantina o a través de Internet. Los bots de Cantina son seres reales, sociales, capaces de interactuar dondequiera que vayan los humanos en internet. Recreate usando inteligencia artificial poderosa, imagina a alguien nuevo o elija entre miles de caracteres existentes. Mantener y optimizar los sistemas de inferencia - Desarmar problemas complejos del modelo que sirve, optimizar el rendimiento de latencia, monitorear la salud del sistema y garantizar un 99.9% de tiempo de actividad para las características impulsadas por IA. Implementar optimizaciones de modelos - Trabajar con arquitecturas de redes neuronales incluyendo redes de difusión, VAEs y transformadores. Aplicar optimizas de transmisión y entender las arquitectura de modelos de video para implementar mejoras efectivas en el desempeño. Un poco sobre usted : 2+ años de experiencia en ingeniería ML con enfoque en la inferencia y despliegue de modelos Conocimiento sólido de las arquitecturas de redes neurales, particularmente redes de difusión, VAEs y modelos transformadores Experiencia con los modelos de video e imagen - Comprensión de cómo funcionan los models de generación de vídeo / imagen, sus arquitectonas y estrategias de optimización específicas para el procesamiento de videos La experiencia de inferencia multi-GPU - Experimentación ejecutando componentes del modelo a través de múltiples GPUs, implementación de estrategia de procesamiento paralelo para modelos grandes Experiencia de alojamiento de modelos de producción Al determinar la compensación, se tendrán en cuenta una serie de factores, incluidas las habilidades, experiencia, alcance del trabajo, ubicación y datos sobre el mercado de compensación competitiva. Beneficios : atención médica - 99% de las primas para médicos, visión, odontología son pagadas completamente por Cantina, más un miembro médico. Beca mensual de bienestar - $ 500 / mes para usar lo que quiera! Descanso y recarga - 15 días PTO al año, 10 días de enfermedad, todos los feriados federales y 2 vacaciones flotantes.

    serp_jobs.job_alerts.create_a_job

    Hiring Engineer Video • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Senior Staff Engineer, AI

    Senior Staff Engineer, AI

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    Staff / Staff Engineer, AI Developer Experience.Key Responsibilities Build and maintain developer workflow tools to enhance the overall development experience Lead and mentor other engineers, ensu...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Zoom Video Communications is hiring : Video AI Engineer in San Jose

    Zoom Video Communications is hiring : Video AI Engineer in San Jose

    MediabistroSan Jose, CA, United States
    serp_jobs.job_card.full_time
    As a Video AI Engineer, you'll enhance video codec standards to improve real-time video quality and performance in Zoom products. Work across our stack, developing software ranging from Web Server t...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior AI Research Engineer

    Senior AI Research Engineer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Generative AI Research Engineer to advance the field of AI computing.Key Responsibilities Design and post-train foundation models for real-world applications Co...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    Zoom is hiring : Video AI Engineer in San Jose

    Zoom is hiring : Video AI Engineer in San Jose

    MediabistroSan Jose, CA, United States
    serp_jobs.job_card.full_time
    As a Video AI Engineer, you'll enhance video codec standards to improve real-time video quality and performance in Zoom products. Work across our stack, developing software ranging from Web Server t...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    AI Researcher for Conversational AI

    AI Researcher for Conversational AI

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Researcher specializing in Large Language Models.Key Responsibilities Conduct research on large language modeling and adaptation for Conversational Avatars Develop...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Agentic AI Engineer

    Agentic AI Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an Agent Forward Engineer to lead the design and deployment of AI agents and software for application development and migration. Key Responsibilities Design and implement ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    AI Engineer

    AI Engineer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Engineer, Principal.Key Responsibilities Collaborate with data scientists and ML engineers to containerize, deploy, and monitor AI / ML models Design, build, and man...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    AIVideo is hiring : AIVideo.com seeks pioneering AI Engineer to reinvent video pr

    AIVideo is hiring : AIVideo.com seeks pioneering AI Engineer to reinvent video pr

    MediabistroSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    We have collected the best dataset in the world for video editing.Our popular web-based video editor produces thousands of user actions per minute. The goal is to use that user data to power a human...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    AI Engineer III

    AI Engineer III

    VirtualVocationsOakland, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an AI Engineer III to lead the design and deployment of advanced AI systems.Key Responsibilities Define architecture, standards, and evaluation strategies for AI systems ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    AI Inference Engineer

    AI Inference Engineer

    Perplexity AISan Francisco, CA, US
    serp_jobs.job_card.full_time
    Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world's leading AI platforms. Perplexity has raised over $1B in venture investment from some ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal AI Engineer, Intelligent Sensors

    Principal AI Engineer, Intelligent Sensors

    1010 Analog Devices Inc.Rio Robles, CA, United States
    serp_jobs.job_card.full_time +1
    NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Applied AI Inference Engineer

    Applied AI Inference Engineer

    BasetenSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast.Backed by top investors including IVP, Spark Capital, Greylock, and Conviction, we’re ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Applied AI Engineer

    Applied AI Engineer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an Applied AI Engineer to design and deploy AI systems for construction workflows.Key Responsibilities Build and deploy agentic AI systems that automate construction work...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Software Engineer, AI / ML Computer Vision, YouTube

    Senior Software Engineer, AI / ML Computer Vision, YouTube

    Google Inc.Mountain View, CA, United States
    serp_jobs.job_card.full_time
    Senior Software Engineer, AI / ML Computer Vision, YouTube – Mountain View, CA, USA.Bachelor’s degree or equivalent practical experience. Computer Vision (image classification and processing, object d...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Machine Learning Video Engineer

    Machine Learning Video Engineer

    Apple Inc.Cupertino, CA, United States
    serp_jobs.job_card.full_time
    Cupertino, California, United States Hardware.Want to work on cutting edge technology that keeps the customer front and center? The Video Engineering group at Apple is responsible for creating the ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal AI / ML Engineer, Security AI

    Principal AI / ML Engineer, Security AI

    Cisco Systems, Inc.San Jose, CA, United States
    serp_jobs.job_card.full_time
    The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world can defend against threats and safeguard the most vital aspe...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Lead AI Engineer

    Lead AI Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Lead AI Engineer to architect and deploy machine learning and AI systems across its payments platform. Key Responsibilities Lead the design, development, and deployment ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Machine Learning Engineer- Video AI / Computer Vision

    Staff Machine Learning Engineer- Video AI / Computer Vision

    Warner Bros. DiscoverySan Francisco, CA, United States
    serp_jobs.job_card.full_time
    When we say, “the stuff dreams are made of,” we’re not just referring to the world of wizards, dragons and superheroes, or even to the wonders of Planet Earth. Behind WBD’s vast portfolio of iconic ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30