Machine Learning Inference Engineer

Procession Systems
Hanover, MD, US
Full-time

Job Description

GENERAL DUTIES :

  • NVIDIA Triton Inference Server Expertise : Leverage your in-depth knowledge of NVIDIA Triton to design and manage scalable and high-performance inference pipelines in a production, enterprise system.
  • Model Deployment : Collaborate with data scientists and software engineers to deploy machine learning models, ensuring optimal performance, resource utilization, and cost tracking and savings.
  • Scalability : Architect and implement solutions to scale machine learning inference to handle large workloads efficiently.
  • Performance Optimization : Monitor and fine-tune model inference for optimal speed and resource utilization.
  • Automation : Implement automation tools and processes for model deployment, monitoring, and scaling.
  • Monitoring and Logging : Develop robust monitoring and logging solutions to track model performance, system health, and data quality in real-time.
  • Security : Help implement security best practices to protect machine learning models and data.
  • Documentation : Maintain detailed documentation of machine learning operations processes and best practices.
  • Collaboration : Work closely with a cross-functional Product team to understand business requirements and translate them into technical solutions.
  • Troubleshooting : Provide technical support for debugging and resolving issues related to model deployment and inference.

Required Skills

REQUIRED QUALIFICATIONS :

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
  • Proven experience (3+ years) as a Machine Learning Operations Engineer with a focus on NVIDIA Triton.
  • Experience with other MLOps tools and platforms
  • Strong programming skills in Python.
  • Familiarity with machine learning frameworks like TensorFlow or PyTorch.
  • Experience with GPU hardware and optimization for deep learning workloads.
  • Strong problem-solving skills and the ability to work effectively in a collaborative team environment.
  • Excellent communication skills and the ability to convey technical concepts to both technical and non-technical stakeholders.
  • Solutions Architect Associate credential or other Associate (In Progress acceptable)

CLEARANCE :

Full Scope Polygraph minimum

Desired Skills

DESIRED QUALIFICATIONS :

  • Proficiency in containerization technologies and orchestration tools (e.g., Docker, AWS Fargate, Amazon Elastic Container Service, AWS Elastic Kubernetes Service).
  • Knowledge of DevOps practices and continuous integration / continuous deployment (CI / CD) pipelines
  • Familiarity with the AWS cloud platform
  • Previous experience in the deployment of machine learning models in production environments.

About Procession Systems

About us

30+ days ago
Related jobs
Promoted
eTeam
Baltimore, Maryland

Machine Learning Engineer/Software Engineer. EC2, Sagemaker, CodeDeploy, SNS), and open sources products (as needed) to build infrastructure and workflows that will support enterprise deployment of machine learning models built in Python and R. Prior experience deploying machine learning models with...

Promoted
GIGATEC
Annapolis Junction, Maryland

Experience using a machine-learning framework (. Do you get excited about learning new technologies, problem solving, and influencing outcomes?. The defense community needs an engineering partner who can not only keep up, but bring the technical expertise and passion necessary to solve the new harde...

Promoted
Maverc Technologies
Columbia, Maryland

A talented Machine Learning Engineer to support our AI Center of Excellence! In this role, you and your team will be responsible for the entire lifecycle of machine learning models, from managing and deploying them to troubleshooting any pipeline issues that arise. Manage and deploy machine learning...

Promoted
Inovalon
Bowie, Maryland

Inovalon was founded in 1998 on the belief that technology, and data specifically, would empower the transformation of the entire healthcare ecosystem for the better, improving both outcomes and economics.At Inovalon, we believe that when our customers are successful in their missions, healthcare im...

Condé Nast
Adelphi, Maryland

Deliver and orchestrate machine learning infrastructure within production environments. Years of software development experience designing scalable systems related to machine learning or more general statistical analysis. Experience with machine learning frameworks such as TensorFlow, JAX, PyTorch, ...

Johns Hopkins Applied Physics Laboratory
Laurel, Maryland

Description Do you have a passion for creating machine-learning-based tools that enable the safe development and deployment of autonomous systems? Do you want to make an impact on the future of our nation's defense capabilities? Do you thrive in dynamic and collaborative environments? If so, we are ...

The Pennsylvania State University
Annapolis Junction, Maryland

In this position you will research, develop, and deliver algorithmic, machine learning, and artificial intelligence based approaches to solve complex sponsor problems. Design, develop, and research machine learning systems, models, and schemes. Study, transform, and apply state-of-the-art machine le...

Medifast, Inc
Baltimore, Maryland

The Machine Learning Engineer will play a critical role in enhancing OPTAVIA’s capabilities through the application of advanced machine learning techniques. The ideal candidate will possess a strong background in machine learning, data analysis, and software engineering. Collaborate with cross-funct...

Maverc Technologies
Columbia, Maryland

A talented Machine Learning Engineer to support our AI Center of Excellence! In this role, you and your team will be responsible for the entire lifecycle of machine learning models, from managing and deploying them to troubleshooting any pipeline issues that arise. Machine Learning Engineer . M...

Prodigy One, LLC
Annapolis Junction, Maryland

We are seeking a Java & Python Software Engineer 3 with Machine Learning (ML) & Artificial Intelligence (AI) experience. Experience with machine learning libraries and frameworks such as TensorFlow, PyTorch, scikit-learn, or similar. Confer with system engineers and hardware engineers to derive soft...