Machine Learning Engineer

Lumicity

CA, United States

Full-time

About the Company

This company develops generative video models that allow users to create animated pictures with ease, incorporating their own existing audio or utilizing text-to-speech models.

Having raised over $10M and generating significant excitement with their first two foundational model releases, they are expanding their team in San Francisco.

About the Role

We're looking for passionate Machine Learning Engineers to join our team and help build cutting-edge systems for large-scale data collection, GPU training, and AI model inference optimization.

If you have deep expertise in model quantization, parallel inference, and accelerating diffusion models, and you're excited about deploying state-of-the-art ML models in the cloud, this is the perfect opportunity for you.

What You'll Do :

Build and scale distributed data collection and curation systems to support large-scale model training and inference.
Optimize GPU-based training pipelines for efficiency and speed, focusing on large-scale model deployment.
Accelerate inference for diffusion models and transformers, leveraging techniques like model quantization and parallel inference.
Optimize and implement CUDA kernels , Triton , and TensorRT to maximize inference performance.
Develop and maintain cloud-based infrastructure (AWS, Oracle) using Kubernetes and Terraform for scalable model deployment.
Architect REST APIs for distributed systems, ensuring high performance and low-latency responses.

What You Bring :

5+ years of experience in Python or Golang, with a strong emphasis on performance optimization.
Expertise in model quantization , parallel inference , and deploying ML models in production.
Hands-on experience with PyTorch , TensorRT , Triton , and CUDA kernels for accelerating model inference, especially in large-scale applications.
Strong background with Kubernetes , Docker , and NVIDIA hardware (GPUs, Tensor Cores).
Experience scaling pipelines in AWS (SQS, Kafka), implementing infrastructure as code using tools like Terraform.
A startup mindset ability to move fast, iterate quickly, and build impactful systems in a fast-evolving space.
Passion for deploying AI technologies at scale and driving innovation in generative models.

Send your resume today and join us in building the next generation of AI-driven video models!

13 hours ago

Related jobs

Promoted

Machine Learning Engineer Graduate (eCommerce Recommendation) - 2025 Start (BS/MA)

TikTok

San Jose, California

Experience in applied machine learning, familiar with one or more of the algorithms such as Collaborative Filtering, Matrix Factorization, Factorization Machines, Word2vec, Logistic Regression, Gradient Boosting Trees, Deep Neural Networks, Wide and Deep etc. Work in a team to conduct cutting-edge r...

Promoted

AI & Machine Learning Engineer - Senior - Consulting - Location OPEN

Westlake Village, California

AI/Machine Learning Engineer, Senior ConsultantThe opportunityOur Artificial Intelligence and Data team helps apply cutting edge technology and techniques to bring solutions to our clients. To qualify for the role you must haveBachelor's degree and 3-6 years of full-time working experience in AI and...

Promoted

Senior Machine Learning Engineer, Shop Ads

TikTok

San Jose, California

We are looking for strong Machine Learning Engineers who are excited to grow their business understanding, build highly scalable and reliable software, and partner across disciplines with global teams in pursuit of excellence. Apply state-of-the-art machine learning techniques to optimize advertiser...

Promoted

Senior Machine Learning Engineer

Curai Health

San Francisco, California

Machine Learning Engineering at Curai. Product sense" or intuition on what to build: You will need to be able to translate product intuition into data-driven hypotheses that result in impactful machine learning/engineering solutions. As the pioneer in deploying machine learning into clinical workflo...

Promoted

Senior Software Engineer, Machine Learning Platform

Discord

San Francisco, California

We sit at the intersection of machine learning engineers (MLEs), core infrastructure, and ML consumers to provide tools, capabilities, and services that make machine learning easy, safe, and widely accessible. The Machine Learning Platform (MLP) at Discord is responsible for the end-to-end model lif...

Promoted

AIML - Machine Learning Engineer, Siri Perception

Apple Inc.

San Francisco, California

As a Machine Learning Engineer, you will play a crucial role in designing, developing, and implementing machine learning algorithms to power this feature. As a Machine Learning Engineer, you will be responsible for developing and applying machine learning techniques to improve Siri's ability to unde...

Promoted

Senior Machine Learning Engineer

Capital One National Association

Richmond, California

As a Capital One Machine Learning Engineer (MLE), you'll be part of an Agile team dedicated to productionizing machine learning applications and systems at scale. You’ll focus on machine learning architectural design, develop and review model and application code, and ensure high availability and pe...

Promoted

Machine learning Engineer

Robotics Prcocess Automation, LLC

Menlo Park, California

Job Title: Machine learning Engineer. Specialized Area: Machine learning. Basic understanding of machine learning networks such as ResNet, ResNeXt. Basic understanding of machine learning operators such as convolutions and fully connected layers. ...

Promoted

Machine Learning Engineer

Starsky Robotics

San Francisco, California

Architect and train machine learning models for object detection and tracking. Background in computer science/electrical engineering/Mathematics. ...

Promoted

Sr. Machine Learning Engineer, AI Infrastructure, Autonomy

Rivian

Palo Alto, California

You will work with a team of talented software engineers, machine learning engineers and research scientists to push the boundary of state-of-art machine learning models which will enable the next-generation E2E solution of autonomous driving. We are looking for a full-time Machine Learning Engineer...