Hybrid in Philadelphia, Pennsylvania
•
4d ago
Consultant Requirements On-Prem LLM & Vector DB Implementation Core Experience Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environmentsStrong proficiency in Python for LLM inference, prompt engineering, and integrationExperience with CPU-based inference , model quantization, and performance tuning Vector Databases & RAG Practical experience with open-source vector databases such as Qdrant , Chroma , Milvus , or pgvec
Easy Apply
Contract
55 - 60



