Search jobs > Palo Alto, CA > Member of technical

Member of Technical Staff, Research Engineer (Inference)

Tbwa Chiat/Day Inc
Palo Alto, California, US
Full-time

About the Role

Please make an application promptly if you are a good match for this role due to high levels of interest.

Member of Technical Staff, Research Engineer (Inference)

As part of Inflection’s commitment to deploying high-performance models for enterprise applications, our inference team ensures that these models run efficiently and effectively in real-world scenarios.

Research engineers in this role focus on optimizing model inference processes, reducing latency, and improving throughput without compromising model performance, ensuring robust deployment in enterprise environments.

This is a good role for you if you :

  • Have experience with deploying and optimizing LLMs for inference, both in cloud and on-prem environments.
  • Are adept at using tools and frameworks for model optimization and acceleration, such as ONNX, TensorRT, or TVM.
  • Enjoy troubleshooting and solving complex problems related to model performance and scaling.
  • Have a deep understanding of the trade-offs involved in model inference, including hardware constraints and real-time processing requirements.
  • Are proficient with PyTorch and familiar with infrastructure management tools like Docker and Kubernetes for deploying inference pipelines.

Employee Pay Disclosures

At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company.

For this role, Inflection AI estimates a starting annual base salary will fall in the range of approximately $200,000 - $350,000.

This estimate can vary based on the factors described above, so the actual starting annual base salary may be above or below this range.

J-18808-Ljbffr

6 days ago
Related jobs
Promoted
QuantumScape
San Jose, California

Metrology Engineer, Member of Technical Staff. As a member of the metrology team, you will help us develop and deploy metrology tools to collect high volume and high velocity data that is predictive of battery and process performance. Professional experience with applying statistical methods to engi...

Promoted
Microsoft
Mountain View, California

Microsoft AI is looking for a talented Growth Android Engineer to help build the next wave of capabilities of our personalized AI assistant, Copilot. The Growth engineering team is responsible for user acquisition, engagement, retention, and feature development that drives rapid growth, while collab...

Promoted
DICE
Mountain View, California

Member of Technical Staff - Android Engineer. Microsoft AI is looking for a talented Android engineer to help build the next wave of capabilities of our personalized AI assistant, Copilot. The Native Engineering team is responsible for leading and building the core experience of Copilot on iOS and A...

Promoted
Microsoft
Mountain View, California

Microsoft AI (MS AI) is seeking experienced Platform Engineers to help build the next wave of capabilities of our personal AI, Copilot. We’re looking for someone who possesses technical prowess, a methodical approach to problem-solving, proficiency in backend technologies, and a mastery of templatin...

Promoted
Ll Oefentherapie
Redwood City, California

As a member of the software engineering division, you will apply basic to intermediate knowledge of software architecture to perform software development tasks associated with developing, debugging, or designing software applications or operating systems according to provided design specifications. ...

Promoted
Contextual AI, Inc.
Mountain View, California

Mentor and provide technical guidance to junior team members, promoting knowledge sharing and professional growth. Work on and do research in state-of-the-art retrieval augmented language models. Collaborate closely with ML researchers, product managers, and designers to understand requirements and ...

Oracle
Santa Clara, California

Principal Engineer is an individual contributor role that requires a proven track record of success and technical depth and maturity as a software developer. Coaching and mentoring other members of the engineering staff. As a Principal Engineer in Oracle Cloud Infrastructure, you will have the oppor...

Integense
San Jose, California

As a Technical Leader, you will work with an experienced team to develop best-in-class ATE solutions by defining the requirements for the next generation of power supply and high-performance pin-electronic integrated circuits. As a fast-growing provider of integrated circuit solutions for the indust...

Governor's Office of Planning and Research
Sacramento County, US

The Governor’s Office of Planning & Research (OPR) is recruiting to fill one (1) limited term, Staff Services Manager III position within the Office of Community Partnerships/Admistrative. Under the general direction of the OCPSC Chief Deputy Director, and with oversight from the Office of Planning ...

Snap Inc.
Palo Alto, California

We’re looking for a Staff Software Engineer at Snap Inc!. We’re deeply committed to the well-being of everyone in our global community, which is why are at the root of everything we do. BS/BA degree in a technical field such as Computer Science or equivalent years of experience. To reflect this, we ...