GenAI engineer

Philadelphia, PA, US • Posted 2 days ago • Updated 2 days ago
Contract Corp To Corp
Contract W2
12 Months
No Travel Required
On-site
Depends on Experience
Fitment

Dice Job Match Score™

📊 Calculating match score...

Job Details

Skills

  • Python
  • RAG
  • Vector DB
  • LLMs
  • Mistral
  • Mixtral

Summary

Consultant Requirements – On-Prem LLM & Vector DB Implementation

 

Core Experience

  • Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
  • Strong proficiency in Python for LLM inference, prompt engineering, and integration
  • Experience with CPU-based inference, model quantization, and performance tuning

 

Vector Databases & RAG

  • Practical experience with open-source vector databases such as QdrantChromaMilvus, or pgvector
  • Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
  • Experience generating and managing embeddings and metadata filtering

Security & Governance

  • Understanding of data privacy, air-gapped deployments, and enterprise security requirements
  • Experience implementing access controls and audit logging

Nice to Have

  • Experience with LangChain or LlamaIndex
  • Exposure to Rust, Go, or C++ for high-performance services
  • Familiarity with Docker and Kubernetes for on-prem deployments
  • Knowledge of inference frameworks (e.g., vLLMllama.cppHugging Face Transformers)
  •  
  • Prior work in regulated or enterprise environments

Deliverables

  • Reference architecture and deployment guidance
  • Working prototype (LLM + vector DB + RAG)
  • Documentation and knowledge transfer to internal teams
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10217276
  • Position Id: 522512-15352-
  • Posted 2 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Philadelphia, Pennsylvania

Today

Third Party, Contract

DOE

Hybrid in Philadelphia, Pennsylvania

3d ago

Easy Apply

Contract

Depends on Experience

Philadelphia, Pennsylvania

Today

Easy Apply

Contract

USD 70.00 - 75.00 per hour

Philadelphia, Pennsylvania

Today

Contract

Compensation information provided in the description

Search all similar jobs