SR Agentic AI / LLM Engineer

Hybrid in Chicago, IL, US • Posted 2 days ago • Updated 2 days ago
Contract W2
No Travel Required
Hybrid
$65 - $70/hr
Fitment

Dice Job Match Score™

🫥 Flibbertigibetting...

Job Details

Skills

  • Lang Chain
  • LLM
  • RAG
  • FastAPI

Summary

SR Agentic AI / LLM Engineer

Hybrid - Chicago, IL (Open for relocation)

 

Looking for a SR Agentic AI / LLM Engineer to work onsite in Chicago. They need SR Level – someone who has built RAG solutions from scratch.

 

 Design, develop, and maintain scalable web services using FastAPI or Flask frameworks.
Write efficient, reusable, and modular Python code to support API-driven LLM applications.
Lang Chain & Supporting Frameworks:
Implement Lang Chain to build custom pipelines for document indexing, retrieval, and summarization.
Integrate Lang Chain’s RAG capabilities with other components like vector stores and retrievers to support real-time querying and document processing.
RAG Pipelines:
Architect and deploy Retrieval-Augmented Generation (RAG) systems for chatbots, knowledge systems, and other generative AI applications.
Optimize RAG systems for speed, accuracy, and scalability across multiple use cases.
Vector Stores & Retrievers:
Work with vector databases like Pinecone, Chroma, FAISS, or Milvus to store and manage embeddings.
Implement retrievers and re-rankers to improve query efficiency, ensuring high-quality and relevant outputs for users.
AWS Cloud Deployment:
Deploy and manage LLM-based applications on AWS, leveraging services such as Lambda, EC2, S3, EKS, and RDS.
Ensure the scalability, availability, and reliability of deployed applications.
Dashboards and Monitoring (Optional):
Create monitoring dashboards using tools like Grafana or Tableau for real-time system monitoring, analytics, and performance insights.
Experimentation with Generative AI:
Research and integrate the latest advancements in generative AI technologies.
Experiment with fine-tuning and adapting large language models (like GPT, BERT) for new, innovative use cases.
Required Technical Skills
Python proficiency, especially with web frameworks like FastAPI or Flask.
Strong experience with Lang Chain and associated libraries.
Proven expertise in building and optimizing RAG pipelines.
Proficiency in using vector databases (e.g., Pinecone, FAISS).
Experience with retrievers and re-rankers.
Solid understanding of AWS services (Lambda, EC2, RDS, etc.).
Knowledge of SQL and NoSQL databases.
Familiarity with dashboarding tools such as Grafana and Tableau.
Soft Skills
Problem-solving: Ability to handle complex and dynamic challenges with AI solutions.
Collaboration: Experience working in multidisciplinary teams (data scientists, DevOps, etc.).
Adaptability: Eagerness and passion to keep up with the latest AI advancements and incorporate them into solutions.
Communication: Excellent verbal and written communication skills to convey technical information to both technical and non-technical stakeholders.  

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91134724
  • Position Id: 8948703
  • Posted 2 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in Chicago, Illinois

2d ago

Easy Apply

Contract, Third Party

Depends on Experience

Hybrid in Chicago, Illinois

22d ago

Easy Apply

Contract

Depends on Experience

Hybrid in Chicago, Illinois

Today

Easy Apply

Contract

$80 - $90

Hybrid in Chicago, Illinois

Today

Easy Apply

Contract

$80+

Search all similar jobs