Data Scientist (LLM)

Overview

On Site

$70 - $80

Contract - W2

Contract - 12 Month(s)

Skills

Algorithms

Artificial Intelligence

Computer Networking

Ensemble

Fusion

GPU

InfiniBand

Large Language Models (LLMs)

Layout

Machine Learning (ML)

Open Source

Optical Character Recognition

Optimization

PPO

Proprietary Software

Remote Direct Memory Access

Semantic Search

Training

Vector Databases

Job Details

Job Description

We are seeking a highly skilled Data Scientist specializing in Large Language Models (LLMs) to lead end-to-end model training, fine-tuning, and optimization across both open-source (Qwen, LLaMA, Mistral) and closed-source (OpenAI, Gemini, Anthropic) ecosystems. The ideal candidate will design advanced retrieval systems, implement distributed training environments, and apply reinforcement learning methods to align AI behavior with human and domain-specific needs.

Key Responsibilities

Lead end-to-end training and fine-tuning of Large Language Models (LLMs).
Architect and implement Graph RAG pipelines for contextual knowledge grounding.
Build and scale distributed training environments using NCCL and InfiniBand for multi-GPU and multi-node training.
Apply reinforcement learning techniques (RLHF, RLAIF) to align models with business and user needs.

Required Qualifications

10+ years of applied AI/ML experience with a track record of delivering production-grade models.
Deep expertise in:
- LLM training & fine-tuning (GPT, LLaMA, Mistral, Qwen)
- Graph-based retrieval systems (GraphRAG, knowledge graphs)
- Embedding models (BGE, E5, SimCSE)
- Semantic search & vector databases (FAISS, Weaviate, Milvus)
- Document segmentation & preprocessing (OCR, layout parsing)
- Distributed training frameworks (NCCL, Horovod, DeepSpeed)
- High-performance networking (InfiniBand, RDMA)
- Model fusion & ensemble methods (stacking, boosting, gating)
- Optimization algorithms (Bayesian, Particle Swarm, Genetic)
- Symbolic AI & rule-based systems
- Meta-learning & Mixture of Experts architectures
- Reinforcement learning (RLHF, PPO, DPO)

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Job Description

Key Responsibilities

Required Qualifications

Share