Senior/Lead Data Scientist (Clinical Document Automation)

Overview

Remote
Depends on Experience
Contract - W2
Contract - 6 Month(s)

Skills

Artificial Intelligence
Cross-functional Team
Data Extraction
Data Modeling
Decision-making
Electronic Health Record (EHR)
Graph Databases
Health Care
LangChain
Natural Language Processing
Neo4j
Python
Regulatory Compliance
SQL

Job Details

Hi,

I m hiring for a Clinical NLP/LLM Data Scientist focused on the the

  • Extracting entities and insights from clinical text (EHR/EMR, unstructured notes, UMLS/FHIR vocabularies).

  • Evaluating and validating LLM outputs for accuracy, compliance, and safety ensuring models meet healthcare standards.

  • Designing and maintaining knowledge graphs in Neo4j to connect and model complex clinical data.

  • Collaborating with engineering and product teams to translate requirements into well-documented, technically precise solutions.

Tech stack: Python, SQL, Transformers/Hugging Face, spaCy/MedSpaCy, LangChain, MLflow.
Must-have experience: Data extraction from structured/unstructured sources, strong NLP background, data modeling and graph database expertise (Neo4j preferred).

This is a chance to directly improve clinical decision-making by building reliable, production-ready AI solutions while working with a highly collaborative cross-functional team.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.