Data Scientist

Remote • Posted 6 days ago • Updated 6 days ago
Full Time
Remote
Depends on Experience
Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

  • Data Scientist
  • "Ph.D." OR "Ph.D" OR "Phd" OR "Doctor of Philosophy"
  • LLM
  • RAG
  • NLP
  • PyTorch

Summary

Data Scientist

Location: REMOTE ok although onsite in Raleigh is preferred.

6 months contract to hire.

Master's or PhD Preferred

Looking for people who have PHD.

  • 6 12+ years in Data Science / ML Engineering, with deep experience in LLM based systems.
  • Proven experience building multi-agent architectures (planner executor, tool use agents, ReAct style reasoning).
  • Strong background in RAG, embeddings, retrieval optimization, and evaluation.
  • Expertise in NLP, transformers, deep learning, and model fine tuning.
  • Proficiency with PyTorch, HuggingFace, LangChain/LlamaIndex, RAG, Kubernetes, and vector databases.
  • Experience designing production grade ML systems with monitoring, evaluation, and observability.

Nice to haves:

  • FANG Experience (Facebook, Amazon, Netflix, Google, or even Microsoft)
  • Lead Experience

Secondary Skills - Nice to Haves

  • Python
  • Machine learning
  • cloud computing

Job Description

client is partnered with a software company in Raleigh that needs to hire a Senior Data Scientist for their flagship product, LexisNexis Legal & Professional, a leading global provider of information and analytics. Recently LexisNexis has focused on the general availability of Lexis+ AI for U.S. customers, a generative AI solution designed to transform legal work. Lexis+ AI delivers trusted results in a familiar, easy-to-use interface with linked hallucination-free legal citations that combine the power of generative AI with proprietary LexisNexis search technology, Shepard s Citations functionality, and authoritative content.

This role leads the design and development of an advanced multi agent AI platform that powers intelligent research, drafting, and reasoning capabilities for large scale enterprise knowledge environments. You will architect agent frameworks, optimize retrieval augmented generation pipelines, fine tune language models, and build the infrastructure that enables AI systems to collaborate, plan, and execute complex tasks reliably. The work directly shapes the next generation of AI driven professional tools used by experts in high stakes domains.

Core Responsibilities

  • Architect and implement multi agent systems capable of planning, tool use, and coordinated task execution.
  • Design and optimize RAG pipelines including embeddings, hybrid retrieval, reranking, and context window strategies.
  • Fine tune and evaluate small, medium, and large language models for domain specific reasoning and summarization.
  • Develop prompt engineering frameworks, guardrails, and automated evaluation suites for agent reliability.
  • Build scalable ML services and APIs for production deployment in distributed environments.
  • Collaborate with product, engineering, and domain experts to translate complex workflows into agentic AI solutions.
  • Establish best practices for model evaluation, observability, safety, and compliance.
  • Mentor DS/ML engineers and contribute to long term AI strategy and architecture.

Required Expertise

  • 6 12+ years in Data Science / ML Engineering, with deep experience in LLM based systems.
  • Proven experience building agentic architectures (planner executor, tool use agents, ReAct style reasoning).
  • Strong background in RAG, embeddings, retrieval optimization, and evaluation.
  • Expertise in NLP, transformers, deep learning, and model fine tuning.
  • Proficiency with PyTorch, HuggingFace, LangChain/LlamaIndex, Ray, Kubernetes, and vector databases.
  • Experience designing production grade ML systems with monitoring, evaluation, and observability.
  • Strong communication skills and ability to lead technical direction.

Preferred Qualifications

  • Experience in enterprise search, knowledge management, or high compliance domains.
  • Experience with model distillation, LoRA/QLoRA, PEFT, and model compression.
  • Experience building evaluation frameworks for hallucination, grounding, and agent reliability.
  • Familiarity with knowledge graphs, symbolic reasoning, or hybrid neuro symbolic systems.
  • Publications, patents, or open source contributions in LLMs or agent systems.
  • Strong coding skills in Python 7+ years
  • Be a natural problem solver, able to take a lead in collaborating to resolve issues
  • Proficiency in IDE debugging : VSCODE and PYCHARM

Have communication skills

  • 5+ years of experience in AI and machine learning
  • Deep understanding of machine learning algorithms, classification models, diagnostic testing of models
  • Experience working directly and Transformer based architectures including BERT, RoBERTa, T5 etc. Nd familiarity with large language models and fine tuning
  • Experience with conversational search / semantic search, reinforcement learning, prompt engineering, hallucination mitigation
  • Working understanding of the business risks associated with applying LLM (LangChain) in a business
  • Experience working with AWS, RAG, SageMaker, SQL
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90973529
  • Position Id: 8995675
  • Posted 6 days ago
Contact the job poster
Vijay Maragoni

Vijay Maragoni

Recruiter! @ Tech Rakers
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote or North Carolina

Today

Easy Apply

Contract

Remote

Yesterday

Easy Apply

Full-time

Up to $55

Remote or New York, New York

Yesterday

Full-time

USD 50.00 - 60.00 per hour

Remote

Yesterday

Full-time

USD 205,000.00 - 270,000.00 per year

Search all similar jobs