Search jobs > Palo Alto, CA > Machine learning engineer

Machine Learning Engineer (Inference)

Acceler8 Talent
Palo Alto, CA, US
Full-time

Elevate AI Performance : Join Us as a Research Engineer in Model Inference!

What We're Building :

As we embark on a new phase of growth, our focus is on collaborating with commercial partners to adapt and fine-tune our state-of-the-art AI models for their unique business needs.

With a strong track record in developing and deploying cutting-edge models in consumer-facing applications, we’re now channeling our expertise into optimizing these models for real-world performance.

Join our team and be part of an innovative organization dedicated to pushing the boundaries of AI.

About the Role :

As a Member of Technical Staff, Research Engineer specializing in inference, you will be at the forefront of optimizing our advanced AI models for efficient and effective deployment in enterprise environments.

Your work will involve fine-tuning the inference processes, reducing latency, and enhancing throughput, all while maintaining the highest standards of model performance.

You'll play a critical role in ensuring that our AI solutions run smoothly and reliably in real-world scenarios.

This Role is Ideal for You If You :

  • Have hands-on experience deploying and optimizing large language models (LLMs) for inference in both cloud and on-premise environments.
  • Are proficient with model optimization tools and frameworks such as ONNX, TensorRT, or TVM.
  • Enjoy troubleshooting and resolving complex issues related to model performance and scalability.
  • Understand the trade-offs involved in model inference, including hardware constraints and real-time processing needs.
  • Are skilled in PyTorch and familiar with using Docker and Kubernetes to manage and deploy inference pipelines.

Why Join Us?

  • Autonomy & Impact : We believe in empowering individual contributors. Here, you’ll have the space and resources to lead projects, make impactful decisions, and see the direct results of your work.
  • Collaborative Culture : Our team thrives on teamwork, mutual respect, and a shared commitment to excellence. We encourage open dialogue, constructive challenges, and continuous learning.
  • Data-Driven Decisions : User feedback and performance metrics are at the core of our AI development process, guiding our priorities and ensuring we deliver the best solutions.
  • Work-Life Balance : We value your well-being and offer unlimited paid time off, flexible parental leave, and a supportive environment to help you recharge and maintain a healthy work-life balance.

Diversity & Inclusion :

We are dedicated to creating AI solutions that serve everyone. We welcome candidates from all backgrounds and are committed to building a diverse and inclusive team.

If you’re passionate about optimizing AI models for real-world applications and want to be part of a team that values innovation and collaboration, we’d love to hear from you! Apply today and showcase your best work, whether through open source contributions, personal projects, or a cover letter highlighting your proudest achievements.

16 days ago
Related jobs
Promoted
VirtualVocations
Santa Clara, California

A company is looking for a Machine Learning Engineer Apprentice to contribute to their ML engineering team. ...

Promoted
MM International
Sunnyvale, California

Role: Machine Learning Engineer. ...

Promoted
MindSource
San Jose, California

Lead a team of machine learning engineers to develop state-of-the-art deep learning solutions for analysis of high-resolution, high-velocity image and measurement data, leading to improved understanding of device performance and improved yield. Job Title: Lead Principal Machine Learning Engineer. Re...

Promoted
AppLovin
Palo Alto, California

We are looking for a seasoned Machine Learning Engineer with a specialization in ML infrastructure and deep learning architecture. Collaboration: Work closely with our talented team of machine learning engineers, data scientists, and software engineers to integrate your solutions into our platform s...

Promoted
TikTok
San Jose, California

Responsible for the development of state-of-the-art applied machine learning projects. BS/MS degree in Computer Science, Computer Engineering, or other relevant majors. ...

Promoted
Amazon
Cupertino, California

The team offers hands-on data science and coding services to our most strategic customer opportunities to launch their training and inference workloads on AWS purpose built ML silicon offerings. You will work directly with customer data scientists and ML engineering teams and write code to have the ...

Promoted
TikTok
San Jose, California

Proficiency or publications in modern machine learning theories and applications such as deep learning, transfer/multi-task learning, reinforcement learning, time series or graph algorithms. Bachelor or degrees above in Computer Science, in computer science, Computer Engineering, or other relevant, ...

Promoted
Idaho Occupational Therapy Associaton
Sunnyvale, California

Designs and develops scalable solutions that leverage machine learning and deep learning models to meet enterprise requirements. Works closely with data scientists and data engineers to develop machine learning algorithms. Understands and translates business and functional needs into machine learnin...

Promoted
TikTok
San Jose, California

Proficiency in modern machine learning theories and applications, including ensemble trees, deep neural networks, transfer/multi-task learning, reinforcement learning, graph theory, and unsupervised learning. We achieve our mission by a) developing state-of-art Machine Learning (ML) solutions to pre...

Promoted
WeRide.ai
San Jose, California

Machine Learning, Deep Learning or High-Performance Computing. PhD in Electrical Engineering, Computer Science/Engineering or a related field. Excellent knowledge of theory and practice of machine learning. Knowledge of common machine learning frameworks. ...