Search jobs > Chicago, IL > Engineer llm

LLM Engineer Offshore India

Insight Global
Chicago, IL, United States
Full-time

We are seeking a skilled LLM Engineer proficient in Python programming and experienced in developing, deploying, and optimizing large language models (LLMs).

The ideal candidate will have hands-on experience with FastAPI or Flask frameworks, Lang Chain implementation, and building Retrieval-Augmented Generation (RAG) pipelines.

You will play a key role in integrating cutting-edge AI technologies to solve complex business problems, focusing on vector stores and retrievers while deploying scalable solutions on AWS.

Key Responsibilities

1. Python Development :

a. Design, develop, and maintain scalable web services using FastAPI or Flask frameworks.

b. Write efficient, reusable, and modular Python code to support API-driven LLM applications.

2. Lang Chain & Supporting Frameworks :

a. Implement Lang Chain to build custom pipelines for document indexing, retrieval, and summarization.

b. Integrate Lang Chains RAG capabilities with other components like vector stores and retrievers to support real-time querying and document processing.

3. RAG Pipelines :

a. Architect and deploy Retrieval-Augmented Generation (RAG) systems for chatbots, knowledge systems, and other generative AI applications.

b. Optimize RAG systems for speed, accuracy, and scalability across multiple use cases.

4. Vector Stores & Retrievers :

a. Work with vector databases like Pinecone, Chroma, FAISS, or Milvus to store and manage embeddings.

b. Implement retrievers and re-rankers to improve query efficiency, ensuring high-quality and relevant outputs for u .

5. AWS Cloud Deployment :

a. Deploy and manage LLM-based applications on AWS, leveraging services such as Lambda, EC2, S3, EKS, and RDS.

b. Ensure the scalability, availability, and reliability of deployed applications.

6. Dashboards and Monitoring (Optional) :

a. Create monitoring dashboards using tools like Grafana or Tableau for real-time system monitoring, analytics, and performance insights.

7. Experimentation with Generative AI :

a. Research and integrate the latest advancements in generative AI technologies.

b. Experiment with fine-tuning and adapting large language models (like GPT, BERT) for new, innovative use cases.

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day.

We are an equal opportunity / affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances.

If you need assistance and / or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to [email protected] .

To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy : https : / / insightglobal.

com / workforce-privacy-policy / .

Required Skills & Experience

Required Technical Skills

Python proficiency, especially with web frameworks like FastAPI or Flask.

Strong experience with Lang Chain and associated libraries.

Proven expertise in building and optimizing RAG pipelines.

Proficiency in using vector databases (e.g., Pinecone, FAISS).

Experience with retrievers and re-rankers.

Solid understanding of AWS services (Lambda, EC2, RDS, etc.).

Knowledge of SQL and NoSQL databases.

Familiarity with dashboarding tools such as Grafana and Tableau.

Soft Skills

Problem-solving : Ability to handle complex and dynamic challenges with AI solutions.

Collaboration : Experience working in multidisciplinary teams (data scientists, DevOps, etc.).

Adaptability : Eagerness and passion to keep up with the latest AI advancements and incorporate them into solutions.

Communication : Excellent verbal and written communication skills to convey technical information to both technical and non-technical stakeholders.

This role is ideal for engineers who are passionate about pushing the boundaries of generative AI and have the technical skills to create cutting-edge, deployable solutions.

Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching.

Employees in this role are also entitled to paid sick leave and / or other paid time off as provided by applicable law.

26 days ago
Related jobs
Insight Global
Chicago, Illinois

We are seeking a skilled LLM Engineer proficient in Python programming and experienced in developing, deploying, and optimizing large language models (LLMs). Write efficient, reusable, and modular Python code to support API-driven LLM applications. Deploy and manage LLM-based applications on AWS, le...

Insight Global
Chicago, Illinois

We are seeking a skilled LLM Engineer proficient in Python programming and experienced in developing, deploying, and optimizing large language models (LLMs). Write efficient, reusable, and modular Python code to support API-driven LLM applications. Deploy and manage LLM-based applications on AWS, le...

Promoted
Cloudera
Chicago, Illinois

Collaborate within and across teams and with UI engineers, quality engineers, UX designers, as well as Product Management, Support, and other external partners. Cloudera is looking for a Backend Staff Software Engineer/Tech Lead to join the Enterprise AI Platform team and help drive development of C...

Promoted
System One
Chicago, Illinois
Remote

For immediate consideration, please connect with me on LinkedIn at ;and then email your resume, work authorization status, current location, availability, and compensation expectations directly make sure to include the exact job title and job location in your email message....

Promoted
Collabera
Chicago, Illinois

Collabera is a Global Digital Solutions Company providing Software Engineering Solutions for the world's most tech-forward organizations in the areas of Engineering, Cloud and Data/AI. Collabera is a Global Digital Solutions Company providing Software Engineering Solutions for the world's mo...

Promoted
NorthShore
Evanston, Illinois

Expected to serve as research scientist to the Department of Psychiatry and Behavioral Sciences providing research development consultation and mentorship for investigators in the department; contributing to research education programs for clinical investigators and/or staff; facilitating disseminat...

Promoted
Strike Social
Chicago, Illinois

Our Data Scientists research and create our learning infrastructure using Python and then work with Data Engineering to build micro services into our data lake. Come help build the next generation of advertising and marketing technology using the latest innovations including: Machine Learning, Real-...

Promoted
Ernst & Young
Chicago, Illinois

Lead the integration of AI and data analytics into client solutions, ensuring that advanced technologies are leveraged to enhance safety and performance across various domains. A genuine passion for helping businesses achieve the full potential of their data and technology investments across various...

Promoted
Ai Build Limited
Chicago, Illinois

Teragonia is focused on developing AI and analytics solutions specifically for private equity portfolio companies, aiming to enhance enterprise value creation. In this role, the individual will be involved in researching, developing, and implementing advanced AI products, collaborating with a divers...

Promoted
LanceSoft, Inc.
Chicago, Illinois

Degree with a strong technical focus (Computer Science, Engineering). Design, develop, debug, and modify components of machine learning and deep learning systems and applications, including data/ETL and feature engineering pipelines. Work collaboratively with data scientists, machine learning engine...