Job Description
Job Title : AI / ML Engineer
Location : Nashville, TN
Duration : 6 months
Day to Day :
First few weeks would be investigating processes for concept design team, so they can understand inner workings of what tasks are needed to complete to move project lifecycle along. Working on single tool (building front and back-end familiarity needed) of product to execute tasks automatically that were under investigation previously
Goal for contact : Look upstream and downstream at other teams to evaluate their processes and figure out how to incorporate those into the AI tool Adding documentation to process outside of what they develop
The successful engineer in this role will :
- Understand how commodity servers, operating systems and networks function, perform and scale.
- Possess superb troubleshooting, project management and problem analysis skills.
- Drive technical innovation and efficiency in infrastructure operations via automation.
- Design server monitoring and management solutions using automation and self-repair.
- Create processes that enhance operational workflow and provide positive customer impact.
- Dive deep to resolve problems at their root, looking for failure patterns amenable to long-term solutions via simplification and automation.
- Avoid re-inventing the wheel and prefer appropriately simple, repeatable solutions over more complex and failure prone ones.
- Recognize and adopt best practices in documentation, testing, security, operational support at scale, and efficient use of resources.
- Develop appropriate metrics to demonstrate performance at improving operational efficiency.
Skills :
5+ years in AI / ML development3+ years implementing ML / DL algorithms in Python (PyTorch, Keras, scikit-learn)PyTorch for advanced research and model customizationKeras for rapid prototyping and TensorFlow integration2+ years building Generative AI applications (LLM-driven solutions in production)1+ years deploying production AI agents at scaleStrong background in AWS cloud-native solutions for ML / AI (SageMaker, Bedrock, Lambda, ECS, EKS)Experience with industrial systems integration and protocols (OPC-UA, Modbus, MQTT, REST APIs)Must haves :
Full stack developmentAI developmentDocumentation and writing to senior leadersTechnical Environment :
Programming & Scripting : Python (primary), Bash, SQLML / AI Frameworks : PyTorch, TensorFlow, Keras, scikit-learnAgent Frameworks : LangChain, AutoGPT, CrewAIAWS Services :
Compute : EC2, Lambda, ECS / EKSData : S3, Glue, Athena, Redshift, DynamoDB, TimestreamAI / ML : SageMaker, Bedrock, Kendra, OpenSearch Vector DBMessaging / Streaming : Kinesis, SQS / SNS, EventBridgeInfra & Security : IAM, VPC, CloudFormation / CDK, AWS-SDK, CloudWatch, Step FunctionsDatabases : Redshift, PostgreSQL, MySQL, DynamoDB, TimestreamVisualization : Matplotlib, Plotly, Grafana, QuickSightCI / CD & DevOps : GitHub / GitLab CI, Docker, Terraform / CDKIndustrial / Edge : OPC-UA, MQTT, REST APIs for IoT / industrial dataStory Behind the Need – Business Group & Key Projects