Machine Learning Inference Engineer

Procession Systems
Hanover, MD, US
Full-time

Job Description

GENERAL DUTIES :

  • NVIDIA Triton Inference Server Expertise : Leverage your in-depth knowledge of NVIDIA Triton to design and manage scalable and high-performance inference pipelines in a production, enterprise system.
  • Model Deployment : Collaborate with data scientists and software engineers to deploy machine learning models, ensuring optimal performance, resource utilization, and cost tracking and savings.
  • Scalability : Architect and implement solutions to scale machine learning inference to handle large workloads efficiently.
  • Performance Optimization : Monitor and fine-tune model inference for optimal speed and resource utilization.
  • Automation : Implement automation tools and processes for model deployment, monitoring, and scaling.
  • Monitoring and Logging : Develop robust monitoring and logging solutions to track model performance, system health, and data quality in real-time.
  • Security : Help implement security best practices to protect machine learning models and data.
  • Documentation : Maintain detailed documentation of machine learning operations processes and best practices.
  • Collaboration : Work closely with a cross-functional Product team to understand business requirements and translate them into technical solutions.
  • Troubleshooting : Provide technical support for debugging and resolving issues related to model deployment and inference.

Required Skills

REQUIRED QUALIFICATIONS :

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
  • Proven experience (3+ years) as a Machine Learning Operations Engineer with a focus on NVIDIA Triton.
  • Experience with other MLOps tools and platforms
  • Strong programming skills in Python.
  • Familiarity with machine learning frameworks like TensorFlow or PyTorch.
  • Experience with GPU hardware and optimization for deep learning workloads.
  • Strong problem-solving skills and the ability to work effectively in a collaborative team environment.
  • Excellent communication skills and the ability to convey technical concepts to both technical and non-technical stakeholders.
  • Solutions Architect Associate credential or other Associate (In Progress acceptable)

CLEARANCE :

Full Scope Polygraph minimum

Desired Skills

DESIRED QUALIFICATIONS :

  • Proficiency in containerization technologies and orchestration tools (e.g., Docker, AWS Fargate, Amazon Elastic Container Service, AWS Elastic Kubernetes Service).
  • Knowledge of DevOps practices and continuous integration / continuous deployment (CI / CD) pipelines
  • Familiarity with the AWS cloud platform
  • Previous experience in the deployment of machine learning models in production environments.

About Procession Systems

About us

30+ days ago
Related jobs
Promoted
Capital One
Baltimore, Maryland

As a Capital One Machine Learning Engineer, you'll be providing technical leadership to engineering teams dedicated to productionizing machine learning applications and systems at scale. You’ll serve as a technical domain expert in machine learning, guiding machine learning architectural design deci...

Promoted
New Relic
Baltimore, Maryland
Remote

Lead Software Machine Learning Engineer - Anomaly Detection (Remote). Lead Software Machine Learning Engineer - Anomaly Detection (Remote). Lead Software Machine Learning Engineer - Anomaly Detection (Remote). Senior Manager, Machine Learning Engineering. ...

Promoted
Money Fit by DRS
Columbia, Maryland

A talented Machine Learning Engineer to support our AI Center of Excellence! In this role, you and your team will be responsible for the entire lifecycle of machine learning models, from managing and deploying them to troubleshooting any pipeline issues that arise. Manage and deploy machine learning...

Promoted
2U
Glenn Dale, Maryland

ML Engineering is responsible for managing machine learning operations, including infrastructure, model deployment and operationalization, to enable data scientists to perform their responsibilities. Architecture and design of engineering solutions around Machine Learning products. ML Engineering al...

Promoted
Capital One
Fulton, Maryland
Remote

West 19th Street (22008), United States of America, New York, New YorkLead Machine Learning Engineer (Remote- Eligible)As a Capital One Machine Learning Engineer (MLE), you''ll be part of an Agile team dedicated to productionizing machine learning applications and systems at scale. You''ll focus on ...

Procession Systems
Hanover, Maryland

Model Deployment: Collaborate with data scientists and software engineers to deploy machine learning models, ensuring optimal performance, resource utilization, and cost tracking and savings. Scalability: Architect and implement solutions to scale machine learning inference to handle large workloads...

Power3 Solutions and Partnering Companies
Annapolis Junction, Maryland

We are seeking a Senior Software Engineer to develop and optimize machine learning (ML) analytics. Develop and maintain a suite of machine learning analytics written in Python and C++. Dynamic small company seeks Software Engineers to work side by side with our most valued and trusted customers. The...

Applied Insight
Hanover, Maryland
Remote

Collaborate with data scientists and software engineers to deploy machine learning models, ensuring optimal performance, resource utilization, and cost tracking and savings. Architect and implement solutions to scale machine learning inference to handle large workloads efficiently. Proven experience...

Maverc Technologies
Columbia, Maryland

A talented Machine Learning Engineer to support our AI Center of Excellence! In this role, you and your team will be responsible for the entire lifecycle of machine learning models, from managing and deploying them to troubleshooting any pipeline issues that arise. Machine Learning Engineer . M...

Johns Hopkins Applied Physics Laboratory
Laurel, Maryland

Description Do you have a passion for creating machine-learning-based tools that enable the safe development and deployment of autonomous systems? Do you want to make an impact on the future of our nation's defense capabilities? Do you thrive in dynamic and collaborative environments? If so, we are ...