Software Developer/Engineer (Mid-Level experience)

Hybrid in Philadelphia, PA, US • Posted 1 day ago • Updated 1 day ago
Contract Independent
Contract W2
No Travel Required
Hybrid
Depends on Experience
Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

  • Software Developer
  • Software Engineer
  • Enterprise AI
  • Open-Source
  • LLMs
  • Python
  • Meta
  • Llama 3
  • Mistral
  • Mixtral
  • On-Prem
  • RAG
  • Vector
  • Qdrant
  • Chroma
  • Milvus
  • Pgvector
  • Retrieval-Augmented Generation
  • Security
  • Governance
  • C++
  • Rust
  • vLLM
  • llama.cpp
  • Hugging
  • Face Transformers

Summary

Title: Software Developer/Engineer (Mid-Level experience).
Location:  Philadelphia, PA.
Duration: 12+ Months Contract (Hybrid, Minimum 3 days in the office).

Job Description:

The Software Developer/Engineer role will be building and deploying Enterprise AI solutions using Open-Source LLMs, Python, and RAG pipelines with Vector Databases. The position emphasizes hands-on development, performance optimization, and secure, on-prem or Air-Gapped deployments, with key deliverables including a working Prototype, Architecture Guidance, and knowledge transfer to internal teams.

Required Experience:

  • On-Prem LLM & Vector DB Implementation
  • Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in On-Prem or Private Environments
  • Strong proficiency in Python for LLM inference, prompt engineering, and integration
  • Experience with CPU-based inference, model quantization, and performance tuning
  • Vector Databases & RAG
  • Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or Pgvector
  • Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
  • Experience generating and managing embeddings and metadata filtering
  • Security & Governance
  • Understanding of data privacy, air-gapped deployments, and enterprise security requirements
  • Experience implementing access controls and audit logging

 Preferred Experience:

  • Experience with LangChain or LlamaIndex
  • Exposure to Rust, Go, or C++ for high-performance services
  • Familiarity with Docker and Kubernetes for on-prem deployments
  • Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)
  • Prior work in regulated or enterprise environments

 Deliverables:

  • Reference architecture and deployment guidance
  • Working prototype (LLM + vector DB + RAG)
  • Documentation and knowledge transfer to internal teams

 

Thank you for your time and I look forward to your reply.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91130883
  • Position Id: 8937235
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Contract

$50 - $60

Hybrid in Philadelphia, Pennsylvania

Today

Easy Apply

Contract

55 - 60

Hybrid in Philadelphia, Pennsylvania

2d ago

Easy Apply

Contract, Third Party

Depends on Experience

Hybrid in Philadelphia, Pennsylvania

Yesterday

Easy Apply

Contract

Up to $58

Search all similar jobs