Hybrid in Philadelphia, Pennsylvania
•
Today
Position Type: Contract Location: Philadelphia | Work Mode: Hybrid, minimum 3 days in the office Consultant Requirements On-Prem LLM & Vector DB Implementation Core Experience Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environmentsStrong proficiency in Python for LLM inference, prompt engineering, and integrationExperience with CPU-based inference, model quantization, and performance tuning Vector Databases & RAG Practical e
Easy Apply
Contract, Third Party
Depends on Experience

