Sr. Data Scientist/ Lead Data Scientist

Overview

Hybrid

Depends on Experience

Accepts corp to corp applications

Contract - W2

Contract - Independent

Contract - 6 Month(s)

Skills

LangGraph

AutoGen

CrewAI

Python

Hugging Face Transformers

Vector DBs

GCP

A/B Test

Job Details

Contract role - Sr. Data Scientist to Lead level

Required experience

Hands-on experience with LLMs, including prompt engineering, fine-tuning, or agentic

architectures (LangGraph, AutoGen, CrewAI, etc.).

Strong programming skills in Python, with experience in building scalable ML/AI applications.

Model evaluation expertise for LLM-based systems, including designing metrics and running A/B

tests or offline experiments.

Python

LangChain / LangGraph / CrewAI / AutoGen (any relevant)

Hugging Face Transformers

Vector DBs (e.g., FAISS, Weaviate, Pinecone)

Git, VS Code, and cloud platforms (Google Cloud Platform preferred)

Contribute to the development of a Knowledge Assistant for the Pro & Services organization, with a

focus on natural language understanding and agentic workflows.

Build, evaluate, and iterate on LLM-powered agents to support task execution, reasoning, and

retrieval across structured and unstructured data.

Collaborate closely with product managers, engineers, and other data scientists to integrate

intelligent systems into customer and associate-facing platforms.

Own model evaluation and validation pipelines, especially for LLM and RAG workflows, including

performance tracking and ablation studies.

Write clean, production-grade Python code and contribute to reusable components and pipelines.

Apply critical thinking and analytical problem-solving to identify patterns, define rules, and

optimize agent behaviors.

Preferred experience

Experience with retrieval-augmented generation (RAG) and knowledge graphs, vector Databases.

Retail or digital experience

Designing custom evaluation pipelines for hallucination detection, factual consistency, and user

relevance.

Familiarity with frameworks like TruLens, Ragas, Promptfoo, or ReAct-style evaluation loops.

Implementing guardrails to ensure safety, compliance, or brand alignment in LLM outputs.

Prior exposure to PhD-level research or applied LLM work in industry or academia.

Disqualifiers

Generic ML experience without any LLM, agentic, or applied AI work.

Heavy reliance on low-code/no-code ML platforms without demonstrated software engineering capability.

Lack of hands-on involvement in end-to-end model deployment or evaluation.

Team environment

The candidate will join a tight-knit, cross-functional AI team focused on building LLM-powered

capabilities for Pro and Services use cases. The team includes Data Scientists, AI Engineers and a

AI Product Manager

The environment is collaborative and fast-paced, with strong product orientation, a culture of rapid

experimentation, and close alignment with business stakeholders. Candidates should be comfortable with

ambiguity, quick pivots, and high-impact deliverables.

Any Additional Details:

Ideal for someone passionate about pushing the boundaries of LLM applications in real-world settings.

There's significant opportunity to shape foundational tools that will power experiences for Pro

customers and store associates.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share