Overview
On Site
USD 82,000.00 - 95,000.00 per year
Full Time
Skills
Science
Named-Entity Recognition (NER)
Mapping
Semantics
Modeling
Artificial Intelligence
Training
Data Stewardship
Collaboration
Mentorship
Innovation
Analytical Skill
Statistics
Bioinformatics
Analytics
Natural Language Processing
Data Structure
Python
Data Integration
Organizational Skills
Data Science
Research
Amazon Web Services
Security+
SEP
Facebook
LinkedIn
Job Details
Data Scientist III
Job no: 537010
Work type: Staff Full-Time
Location: Main Campus (Gainesville, FL)
Categories: Physical/Mathematical Sciences
Department:29051308 - MD-MED QUANTITATIVE HEALTH
Classification Title:
Data Scientist III
Classification Minimum Requirements:
A Bachelor s Degree in data science, statistics, bioinformatics, analytics, or similar field and five years of experience; Master s Degree in data science, statistics, bioinformatics, analytics, or similar field and three years of experience; Doctoral Degree in data science, statistics, bioinformatics, analytics, or similar field and one year of experience.
Job Description:
A Bachelor s Degree in data science, statistics, bioinformatics, analytics, or similar field and five years of experience; Master s Degree in data science, statistics, bioinformatics, analytics, or similar field and three years of experience; Doctoral Degree in data science, statistics, bioinformatics, analytics, or similar field and one year of experience.
Essential Functions;
NLP Pipeline Development and Text Analysis
- Build and maintain pipelines using tools such as MedSpaCy, cTAKES, or similar to extract structured variables from clinical notes.
- Tune entity recognition, concept mapping, and negation detection to support patient-level feature generation.
- Document pipeline logic and validation metrics.
Structured EHR Feature Engineering
- Develop tools to extract, clean, and organize structured EHR variables (e.g., labs, medications, diagnoses).
- Apply clinical standards (e.g., OMOP, FHIR) to support semantic consistency and cross-site interoperability.
- Transform EHR data into research-ready formats aligned with modeling needs.
Temporal Alignment and Multimodal Integration
- Align clinical events with imaging and biopsy timelines to enable time-resolved analysis.
- Support the construction of longitudinal patient records for AI model training and validation.
- Troubleshoot data conflicts, gaps, and synchronization issues.
Collaboration and Data Stewardship
- Liaise with institutional data providers to ensure accurate, secure data transfers.
- Contribute to protocol development and maintain clear analytic traceability.
Communicate updates and collaborate across project teams and external sites. Mentorship and Innovation Support
- Provide informal guidance to student researchers or junior analysts.
- Recommend new tools or analytic methods to improve pipeline performance.
Expected Salary:
$82,000 - $95,000
Required Qualifications:
A Bachelor s Degree in data science, statistics, bioinformatics, analytics, or similar field and five years of experience; Master s Degree in data science, statistics, bioinformatics, analytics, or similar field and three years of experience; Doctoral Degree in data science, statistics, bioinformatics, analytics, or similar field and one year of experience.
Preferred:
Experience with NLP in the clinical domain using libraries like MedSpaCy or cTAKES
Knowledge of EHR data structures, standards, and interoperability frameworks (e.g., OMOP, FHIR)
Familiarity with Python and clinical data integration tools
Strong organizational skills and attention to reproducibility and versioning
Experience collaborating with clinical, data science, or research stakeholders
Additional technical certifications (e.g., AWS, Security+, etc.) may be encouraged but not required.
Special Instructions to Applicants:
In order to be considered, you must upload your cover letter and resume.
Application must be submitted by 11:55 p.m. (ET) of the posting end date.
Health Assessment Required:No
Advertised: 08 Sep 2025 Eastern Daylight Time
Applications close: 15 Sep 2025 Eastern Daylight Time
Whatsapp Facebook LinkedIn Email App
Job no: 537010
Work type: Staff Full-Time
Location: Main Campus (Gainesville, FL)
Categories: Physical/Mathematical Sciences
Department:29051308 - MD-MED QUANTITATIVE HEALTH
Classification Title:
Data Scientist III
Classification Minimum Requirements:
A Bachelor s Degree in data science, statistics, bioinformatics, analytics, or similar field and five years of experience; Master s Degree in data science, statistics, bioinformatics, analytics, or similar field and three years of experience; Doctoral Degree in data science, statistics, bioinformatics, analytics, or similar field and one year of experience.
Job Description:
A Bachelor s Degree in data science, statistics, bioinformatics, analytics, or similar field and five years of experience; Master s Degree in data science, statistics, bioinformatics, analytics, or similar field and three years of experience; Doctoral Degree in data science, statistics, bioinformatics, analytics, or similar field and one year of experience.
Essential Functions;
NLP Pipeline Development and Text Analysis
- Build and maintain pipelines using tools such as MedSpaCy, cTAKES, or similar to extract structured variables from clinical notes.
- Tune entity recognition, concept mapping, and negation detection to support patient-level feature generation.
- Document pipeline logic and validation metrics.
Structured EHR Feature Engineering
- Develop tools to extract, clean, and organize structured EHR variables (e.g., labs, medications, diagnoses).
- Apply clinical standards (e.g., OMOP, FHIR) to support semantic consistency and cross-site interoperability.
- Transform EHR data into research-ready formats aligned with modeling needs.
Temporal Alignment and Multimodal Integration
- Align clinical events with imaging and biopsy timelines to enable time-resolved analysis.
- Support the construction of longitudinal patient records for AI model training and validation.
- Troubleshoot data conflicts, gaps, and synchronization issues.
Collaboration and Data Stewardship
- Liaise with institutional data providers to ensure accurate, secure data transfers.
- Contribute to protocol development and maintain clear analytic traceability.
Communicate updates and collaborate across project teams and external sites. Mentorship and Innovation Support
- Provide informal guidance to student researchers or junior analysts.
- Recommend new tools or analytic methods to improve pipeline performance.
Expected Salary:
$82,000 - $95,000
Required Qualifications:
A Bachelor s Degree in data science, statistics, bioinformatics, analytics, or similar field and five years of experience; Master s Degree in data science, statistics, bioinformatics, analytics, or similar field and three years of experience; Doctoral Degree in data science, statistics, bioinformatics, analytics, or similar field and one year of experience.
Preferred:
Experience with NLP in the clinical domain using libraries like MedSpaCy or cTAKES
Knowledge of EHR data structures, standards, and interoperability frameworks (e.g., OMOP, FHIR)
Familiarity with Python and clinical data integration tools
Strong organizational skills and attention to reproducibility and versioning
Experience collaborating with clinical, data science, or research stakeholders
Additional technical certifications (e.g., AWS, Security+, etc.) may be encouraged but not required.
Special Instructions to Applicants:
In order to be considered, you must upload your cover letter and resume.
Application must be submitted by 11:55 p.m. (ET) of the posting end date.
Health Assessment Required:No
Advertised: 08 Sep 2025 Eastern Daylight Time
Applications close: 15 Sep 2025 Eastern Daylight Time
Whatsapp Facebook LinkedIn Email App
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.