Apply Now

On-Premises LLM & Vector Database Implementation Consultant

Hybrid in Philadelphia, PA, US • Posted 16 hours ago • Updated 16 hours ago

Contract W2

No Travel Required

Hybrid

$70 - $75/hr

Fitment

Dice Job Match Score™

🫥 Flibbertigibetting...

Job Details

Skills

LLM
RAG
Vector DB

Summary

Job Description

We are seeking an experienced consultant to lead the design and deployment of a secure, on-premises Large Language Model (LLM) solution integrated with vector database and Retrieval-Augmented Generation (RAG) capabilities. The ideal candidate brings deep hands-on expertise across the full stack — from model deployment and inference optimization to enterprise security and knowledge transfer.

Core Experience

The consultant must have demonstrated experience deploying open-source LLMs, including models such as Meta Llama 3 and Mistral/Mixtral, within on-premises or private infrastructure environments. Strong Python proficiency is essential, particularly for LLM inference pipelines, prompt engineering, and system integration. The role also requires expertise in CPU-based inference strategies, model quantization techniques, and performance tuning to ensure efficient operation in resource-constrained environments.

Vector Databases & RAG

Candidates must have practical, production-level experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector. A strong track record of designing and implementing end-to-end RAG pipelines is required, along with expertise in embedding generation, management, and metadata filtering to support accurate and efficient semantic retrieval.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: indony
Position Id: JPC - 202407
Posted 16 hours ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Vector Databases & RAG Consultant

Hybrid in Philadelphia, Pennsylvania

•

Today

Position Type: Contract Location: Philadelphia | Work Mode: Hybrid, minimum 3 days in the office Consultant Requirements On-Prem LLM & Vector DB Implementation Core Experience Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environmentsStrong proficiency in Python for LLM inference, prompt engineering, and integrationExperience with CPU-based inference, model quantization, and performance tuning Vector Databases & RAG Practical e

Easy Apply

Contract, Third Party

Depends on Experience

Software Developer/Engineer

Hybrid in Philadelphia, Pennsylvania

•

Today

Job Title: Software Developer/Engineer Position Type: Contract Location: Philadelphia ,Pennsylvania. Note: Hybrid, minimum 3 days in the office Interview Schedule: 1st interview, 1-hour, in-person; 2nd interview, 1-hour, in-person Consultant Requirements On-Prem LLM & Vector DB Implementation Core Experience Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environmentsStrong proficiency in Python for LLM inference, prompt engineering

Easy Apply

Third Party, Contract

$60 - $70

Software Engineer/Developer - Midlevel (Hybrid- Philadelphia, PA)

Hybrid in Philadelphia, Pennsylvania

•

Today

Software Engineer/Developer - Midlevel (Hybrid- Philadelphia, PA) We are looking to hire a candidate with the skills sets mentioned and experience for one of our clients within the oil/gas industry. This is a Hybrid role in Philadelphia, PA, with minimum 3 days in the office. Position Summary/Key Responsibilities: Security & GovernanceUnderstanding of data privacy, air-gapped deployments, and enterprise security requirements.Experience implementing access controls and audit logging.DeliverablesR

Easy Apply

Contract

Up to $58

mid-level Generative AI / LLM-focused Software Developer (AI Engineer)

Hybrid in Philadelphia, Pennsylvania

•

Today

Dear Partner, Good Morning , Greetings from Nukasani group Inc !, We have below urgent long term contract project immediately available for mid-level Generative AI / LLM-focused Software Developer (AI Engineer), Philadelphia, PA, Hybrid need submissions you please review the below role, if you are available, could you please send me updated word resume, and below candidate submission format details, immediately. If you are not available, any referrals would be greatly appreciated. Interviews are

Easy Apply

Contract

60 - 65

Search all similar jobs