Clinical Data Scientist (NLP)?

Overview

On Site
BASED ON EXPERIENCE
Full Time
Contract - W2
Contract - Independent

Skills

MEDSPACY
CTAKES
CLINICAL NLP
BIOMEDICAL NLP
ENTITY RECOGNITION
CONCEPT MAPPING
NEGATION DETECTION
ONTOLOGY MAPPING
EHR
EMR
EPIC
CERNER
OMOP
FHIR
HL7
LOINC
SNOMED
ICD
RXNORM
PYTHON
ETL
ETL PIPELINES
FEATURE ENGINEERING
DATA INTEGRATION
TEMPORAL ALIGNMENT
DATA HARMONIZATION
ACADEMIC MEDICAL CENTER
CLINICAL INFORMATICS
TRANSLATIONAL RESEARCH
BIOMEDICAL AI
CLINICAL AI
HEALTH INFORMATICS
BIOMEDICAL INFORMATICS
CLINICAL DATA SCIENCE
MEDICAL INFORMATICS
CLINICAL ANALYTICS
HEALTHCARE ANALYTICS
HEALTH DATA SCIENCE
CLINICAL DECISION SUPPORT
PUBLIC HEALTH INFORMATICS
CLINICAL DATA ENGINEERING
ACADEMIC RESEARCH
CLINICAL RESEARCH INFORMATICS
MEDICAL SCHOOL
RESEARCH HOSPITAL
HEALTHCARE AI
MEDICAL AI
CLINICAL ML
BIOMEDICAL MACHINE LEARNING
CLINICAL MACHINE LEARNING
EXPERIENCE

Job Details

Position Title: Clinical Data Scientist (NLP)
About the Job
Duration: 12-month contract with possibility of extension
Work Location : Gainesville, FL 32611
Job : 25-03148

Internal Position Title -
Data Architect/ Data Modeler/ Database Analyst/ Data Warehouse Analyst/ Product Developer

Scope of work
NLP Pipeline Development and Text Analysis - 35%
Build and maintain pipelines using tools such as MedSpaCy, cTAKES, or similar to extract structured variables from clinical notes.
Tune entity recognition, concept mapping, and negation detection to support patient-level feature generation.
Document pipeline logic and validation metrics.

Structured EHR Feature Engineering - 25%
Develop tools to extract, clean, and organize structured EHR variables (e.g., labs, medications, diagnoses).
Apply clinical standards (e.g., OMOP, FHIR) to support semantic consistency and cross-site interoperability.
Transform EHR data into research-ready formats aligned with modeling needs.

Temporal Alignment and Multimodal Integration - 20%
Align clinical events with imaging and biopsy timelines to enable time-resolved analysis.
Support the construction of longitudinal patient records for AI model training and validation.
Troubleshoot data conflicts, gaps, and synchronization issues.

Collaboration and Data Stewardship - 15%
Liaise with institutional data providers to ensure accurate, secure data transfers.
Contribute to protocol development and maintain clear analytic traceability.
Communicate updates and collaborate across project teams and external sites.

Mentorship and Innovation Support - 5%
Provide informal guidance to student researchers or junior analysts.
Recommend new tools or analytic methods to improve pipeline performance.

Level: Intermediate

Skills
Master's degree in bioinformatics, biomedical engineering, computer science, data science, or a related field and three years of relevant experience; or a doctoral degree and one year of experience.
Experience with NLP in the clinical domain using libraries like MedSpaCy or cTAKES
Knowledge of EHR data structures, standards, and interoperability frameworks (e.g., OMOP, FHIR)
Familiarity with Python and clinical data integration tools
Strong organizational skills and attention to reproducibility and versioningExperience collaborating with clinical, data science, or research stakeholders

About our Company
DataSoft Technologies is a highly recognized provider of professional IT Consulting services in the US. Founded in 1994, DataSoft Technologies, Inc. provides staff augmentation services for Information Technology and Automotive Services. Our team member benefits include:
Paid Holidays/Paid Time Off (PTO)
Medical/Dental Insurance
Vision Insurance
Short Term/Long Term Disability
Life Insurance
401 (K)

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Datasoft Technologies, Inc.