On-Premises LLM & Vector Database Implementation Consultant

Hybrid in Philadelphia, PA, US • Posted 16 hours ago • Updated 16 hours ago
Contract W2
No Travel Required
Hybrid
$70 - $75/hr
Fitment

Dice Job Match Score™

🫥 Flibbertigibetting...

Job Details

Skills

  • LLM
  • RAG
  • Vector DB

Summary

Job Description

We are seeking an experienced consultant to lead the design and deployment of a secure, on-premises Large Language Model (LLM) solution integrated with vector database and Retrieval-Augmented Generation (RAG) capabilities. The ideal candidate brings deep hands-on expertise across the full stack — from model deployment and inference optimization to enterprise security and knowledge transfer.

Core Experience

The consultant must have demonstrated experience deploying open-source LLMs, including models such as Meta Llama 3 and Mistral/Mixtral, within on-premises or private infrastructure environments. Strong Python proficiency is essential, particularly for LLM inference pipelines, prompt engineering, and system integration. The role also requires expertise in CPU-based inference strategies, model quantization techniques, and performance tuning to ensure efficient operation in resource-constrained environments.

Vector Databases & RAG

Candidates must have practical, production-level experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector. A strong track record of designing and implementing end-to-end RAG pipelines is required, along with expertise in embedding generation, management, and metadata filtering to support accurate and efficient semantic retrieval.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: indony
  • Position Id: JPC - 202407
  • Posted 16 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Contract, Third Party

Depends on Experience

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Third Party, Contract

$60 - $70

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Contract

Up to $58

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Contract

60 - 65

Search all similar jobs