Urgent Hiring MLOps Engineer with AWS_Remote

Apptad Inc
FL, United States
Remote
Full-time
Quick Apply

Job Position : Machine Learning Operations (MLOps) Engineer - AWS (with LLM Focus)

Job Location : Remote

Responsibilities :

LLM-Optimized MLOps Infrastructure : Design and implement MLOps infrastructure on AWS tailored for LLMs, leveraging services like SageMaker, EC2 (with GPU instances), S3, ECS / EKS, Lambda, and more.

LLM Deployment Pipelines : Build and manage CI / CD pipelines specifically for LLM deployment, addressing unique challenges like model size, inference optimization, and versioning.

LLMOps Practices : Implement LLMOps best practices for monitoring model performance, drift detection, prompt management, and feedback loops for continuous improvement.

RESTful API Development : Design and develop RESTful APIs to expose LLM capabilities to other applications and services, ensuring scalability, security, and optimal performance.

Model Optimization : Apply techniques like quantization, distillation, and pruning to optimize LLM models for efficient inference on AWS infrastructure.

Monitoring and Observability : Establish comprehensive monitoring and alerting mechanisms to track LLM performance, latency, resource utilization, and potential biases.

Prompt Engineering and Management : Develop strategies for prompt engineering and management to enhance LLM outputs and ensure consistency and safety.

Collaboration : Work closely with data scientists, researchers, and software engineers to integrate LLM models into production systems effectively.

Cost Optimization : Continuously optimize LLMOps processes and infrastructure for cost-efficiency while maintaining high performance and reliability.

Qualifications :

Experience : 3+ years of experience in MLOps or a related field, with hands-on experience in deploying and managing LLMs.

AWS Expertise : Strong proficiency in AWS services relevant to MLOps and LLMs, including SageMaker, EC2 (with GPU instances), S3, ECS / EKS, Lambda, and API Gateway.

LLM Knowledge : Deep understanding of LLM architectures (e.g., Transformers), training techniques, and inference optimization strategies.

Programming Skills : Proficiency in Python and experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation), REST API frameworks (e.

g., Flask, FastAPI), and LLM libraries (e.g., Hugging Face Transformers).

Monitoring : Familiarity with monitoring and logging tools for LLMs, such as Prometheus, Grafana, and CloudWatch.

Containerization : Experience with Docker and container orchestration (e.g., Kubernetes, ECS) for LLM deployment.

Problem Solving : Excellent problem-solving and troubleshooting skills in the context of LLMs and MLOps.

Communication : Strong communication and collaboration skills to effectively work with cross-functional teams.

1 day ago
Related jobs
Apptad Inc
FL, United States
Remote

Job Position: Machine Learning Operations (MLOps) Engineer - AWS (with LLM Focus)</b></font></p> <p style="text-align:start; text-indent:0px; -webkit-text-stroke-width:0px"><font face="times new roman, serif"><b>Job Location: Remote</b>&l...

Promoted
VirtualVocations
Davie, Florida
Remote

A company is looking for an AWS Cloud Engineer for a remote position. Key Responsibilities:Plan, organize, and control project objectivesImplement cloud and virtualized environments for cloud solutionsEnsure solutions follow security and compliance controlsRequired Qualifications:5-7 years of experi...

Argyllinfotech
Hollywood, Florida

Required Skills</i></div> </div> </div> <div> <div>Job Description</div> <p>Responsibilities:</p> <ul> <li><b>LLM-Optimized MLOps Infrastructure:</b> Design and implement MLOps infrastructure on AWS tailored for LLMs, lev...

RICEFW Technologies Inc
Hollywood, Florida

Responsibilities:</p> <ul> <li><b>LLM-Optimized MLOps Infrastructure:</b> Design and implement MLOps infrastructure on AWS tailored for LLMs, leveraging services like SageMaker, EC2 (with GPU instances), S3, ECS/EKS, Lambda, and more. AWS Expertise:</b> Strong p...

Vision It US
Hollywood, Florida

Responsibilities:</p> <ul> <li><b>LLM-Optimized MLOps Infrastructure:</b> Design and implement MLOps infrastructure on AWS tailored for LLMs, leveraging services like SageMaker, EC2 (with GPU instances), S3, ECS/EKS, Lambda, and more. AWS Expertise:</b> Strong p...

Vakulatech
Hollywood, Florida

Responsibilities:</p> <ul> <li><b>LLM-Optimized MLOps Infrastructure:</b> Design and implement MLOps infrastructure on AWS tailored for LLMs, leveraging services like SageMaker, EC2 (with GPU instances), S3, ECS/EKS, Lambda, and more. AWS Expertise:</b> Strong p...

Transaction Network Services
Orlando, Florida
Remote

The ability to easily collaborate with Product Managers, Operational Engineers, Sales Representatives, Managers, and Sales Engineers. Manages product architecture-related questions & escalations with Sales and Sales Engineers and Product Managers. Requires a resource with strong technical skills and...

Splunk Inc
Florida, United States
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Knowledge of working with and automating linux systems tasks using this language is required, incl...

Extend Information Systems
Tampa, Florida

Hi,</p> <p class="wordsection1"> </p> <p class="wordsection1">I hope you are doing well!</p> <p class="wordsection1"> </p> <p class="x">We have an opportunity for <b>Network Engineer </b>with one o...

Transaction Network Services
Orlando, Florida
Remote

Work with stakeholders to develop strategic plans and work with cross-functional teams to implement and assist in operation. Demonstrated architecture and implementation skills with AWS cloud infrastructure platforms, tools, and services. Transaction Network Services (TNS), a Koch Industries company...