Data Scientist Clinical NLP & AI - Locals Only

Overview

On Site
Depends on Experience
Contract - W2
Contract - Independent
Contract - 12 Month(s)

Skills

Data Scientist
NLP
LLM
AWS
Pyspark

Job Details

As a Data Scientist Clinical NLP & AI, you will be part of an agile team focused on building intelligent healthcare solutions by developing advanced NLP modules, integrating LLMs and agentic workflows, and leveraging AWS big data technologies to enhance clinical data processing and usability.

Responsibilities: -

  • Analyze and process clinical textual data using AI-powered NLP techniques and advanced machine learning models.
  • Modify and improve current workflows by incorporating cutting-edge machine learning and deep learning algorithms, including leveraging large language models (LLMs) and tools like LangGraph for complex AI agentic workflows in healthcare contexts.
  • Develop NLP modules within the NLP development team using programming or scripting languages such as Python.
  • Conduct pre-processing and quality analysis for textual data inputs and validate performance of NLP outputs.
  • Create systematic testing procedures, error-checking mechanisms, and user manuals for NLP modules.
  • Build infrastructure for optimal extraction, transformation, and loading of data from diverse sources including MCP servers, using SQL and AWS big data frameworks such as EMR and Spark/pySpark.
  • Collaborate with Engineering teams to ensure scalable and efficient data workflows using SQL and AWS big data technologies.
  • Apply working knowledge of AWS services, particularly AWS Bedrock, to develop generative AI applications.
  • Utilize relational databases such as PostgreSQL or MySQL for data storage and retrieval in NLP and AI workflows..

Experience: -

  • 7+ Years

Location: -

  • Chicago, IL

Educational Qualifications: -

  • Engineering Degree BE/ME/BTech/MTech/BSc/MSc.
  • Technical certification in multiple technologies is desirable.

Skills: -

Mandatory skills

  • Proficiency in Python and scripting languages for NLP and machine learning development.
  • Strong understanding of clinical NLP techniques and experience with machine learning and deep learning models.
  • Hands-on experience with large language models and agentic workflow tools such as LangGraph.
  • Expertise in SQL and big data technologies including AWS EMR and Spark/pySpark.
  • Practical knowledge of AWS services, especially AWS Bedrock for generative AI applications.
  • Experience with relational databases such as PostgreSQL or MySQL.

Good to have skills: -

  • Familiarity with generative AI applications in healthcare and related use cases.
  • Understanding of healthcare data standards and terminologies such as HL7, FHIR, and CCDA.
  • Experience in creating detailed documentation, user manuals, and technical specifications.
  • Background in automated testing and validation frameworks for NLP outputs.
  • Ability to collaborate effectively with cross-functional teams including engineering and products.
  • Exposure to LangChain or similar frameworks for building intelligent agent workflows.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About People Force Consulting Inc