Vector Databases & RAG Consultant

Hybrid in Philadelphia, PA, US • Posted 14 hours ago • Updated 14 hours ago
Contract Corp To Corp
Contract Independent
Contract W2
No Travel Required
Able to Sponsor
Hybrid
Depends on Experience
Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

  • Python
  • Vector Databases
  • LLM
  • CPU
  • Performance Tuning
  • RAG
  • data privacy

Summary

 

Position Type: Contract
Location: Philadelphia | Work Mode: Hybrid, minimum 3 days in the office

 

Consultant Requirements – On-Prem LLM & Vector DB Implementation

 

Core Experience

  • Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
  • Strong proficiency in Python for LLM inference, prompt engineering, and integration
  • Experience with CPU-based inference, model quantization, and performance tuning

 

Vector Databases & RAG

  • Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
  • Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
  • Experience generating and managing embeddings and metadata filtering

Security & Governance

  • Understanding of data privacy, air-gapped deployments, and enterprise security requirements
  • Experience implementing access controls and audit logging

Nice to Have

  • Experience with LangChain or LlamaIndex
  • Exposure to Rust, Go, or C++ for high-performance services
  • Familiarity with Docker and Kubernetes for on-prem deployments
  • Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)
  •  
  • Prior work in regulated or enterprise environments

Deliverables

  • Reference architecture and deployment guidance
  • Working prototype (LLM + vector DB + RAG)
  • Documentation and knowledge transfer to internal teams
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: marlabnj
  • Position Id: 8936403
  • Posted 14 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Contract

70 - 75

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Third Party, Contract

$60 - $70

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Contract

Up to $58

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Contract

60 - 65

Search all similar jobs