Job Description
Job Description
About the Opportunity
We operate in the Artificial Intelligence and Software Development sector, delivering scalable AI-powered applications and automation for enterprise customers. This role sits at the intersection of generative AI, NLP, and cloud-native engineering, focused on moving research-grade models into reliable production services.
Intellivon is hiring a remote AI Engineer based in the USA to design, build, and deploy advanced AI solutions that power product features and internal automation.
Role Responsibilities
- Design, develop, and maintain production-grade AI services and microservices that power LLM and NLP features.
- Fine-tune, benchmark, and optimize transformer models for cost, latency, and accuracy across inference workloads.
- Implement robust inference pipelines and orchestration (batch and real-time) with observability and monitoring.
- Integrate AI capabilities into product APIs and client-facing endpoints; own API contracts and versioning.
- Containerize models and services, build CI / CD for model packaging, and collaborate on deployment to cloud environments.
- Partner with data scientists and product teams to translate research prototypes into scalable, well-tested features.
Skills QualificationsMust-Have
PythonPyTorchTensorFlowTransformers (Hugging Face)LangChainDockerAWSRESTful APIsPreferred
MLOps tooling (model registry, CI for models)KubernetesExperience with prompt engineering and retrieval-augmented generation (RAG)Benefits Culture Highlights
Remote-first, USA-based role with flexible work hours and asynchronous collaboration.Opportunity to work on cutting-edge generative AI products and influence model design and deployment standards.Fast-paced, learning-driven environment with mentorship and frequent cross-functional ownership.