Data Scientist

  • Chicago, IL
  • Posted 17 hours ago | Updated moments ago

Overview

On Site
Accepts corp to corp applications
Contract - Long term

Skills

NLP
clinical
AI
Data Scientist

Job Details

Data Scientist

Experience: 7+ Years Location: Chicago, IL
What is in it for you?

As a Data Scientist Clinical NLP & AI, you will be part of an agile team focused on building intelligent healthcare solutions by developing advanced NLP modules, integrating LLMs and agentic workflows, and leveraging AWS big data technologies to enhance clinical data processing and usability.

Responsibilities:
  • Proficient developer in multiple languages, Python is a must, with the ability to quickly learn new ones.

  • Expertise in SQL (complex queries, relational databases preferably PostgreSQL, and NoSQL databases - Redis and Elasticsearch).

  • Extensive big data experience, including EMR, Spark, Kafka/Kinesis, and optimizing data pipelines, architectures, and datasets.

  • AWS expert with hands-on experience in Lambda, Glue, Athena, Kinesis, IAM, EMR/PySpark, Docker.

  • Proficient in CI/CD development using Git, Terraform, and agile methodologies.

  • Comfortable with stream-processing systems (Storm, Spark-Streaming) and workflow management tools (Airflow).

  • Exposure to knowledge graph technologies (Graph DB, OWL, SPARQL) is a plus.

  • Experience in Machine Learning Frameworks: TensorFlow, PyTorch, Scikit-learn, XGBoost.

  • Experience in model deployment - Flask, FastAPI, Docker, Kubernetes, TensorFlow Serving, TorchServe.

Skills: Mandatory skills
  • Proficient developer in multiple languages, Python is a must, with the ability to quickly learn new ones.

  • Expertise in SQL (complex queries, relational databases preferably PostgreSQL, and NoSQL databases - Redis and Elasticsearch).

  • Extensive big data experience, including EMR, Spark, Kafka/Kinesis, and optimizing data pipelines, architectures, and datasets.

  • AWS expert with hands-on experience in Lambda, Glue, Athena, Kinesis, IAM, EMR/PySpark, Docker.

  • Proficient in CI/CD development using Git, Terraform, and agile methodologies.

  • Comfortable with stream-processing systems (Storm, Spark-Streaming) and workflow management tools (Airflow).

  • Exposure to knowledge graph technologies (Graph DB, OWL, SPARQL) is a plus.

  • Experience in Machine Learning Frameworks: TensorFlow, PyTorch, Scikit-learn, XGBoost.

  • Experience in model deployment - Flask, FastAPI, Docker, Kubernetes, TensorFlow Serving, TorchServe.

Good to have skills
  • Familiarity with generative AI applications in healthcare and related use cases.

  • Understanding of healthcare data standards and terminologies such as HL7, FHIR, and CCDA.

  • Experience in creating detailed documentation, user manuals, and technical specifications.

  • Background in automated testing and validation frameworks for NLP outputs.

  • Ability to collaborate effectively with cross-functional teams including engineering and products.

  • Exposure to LangChain or similar frameworks for building intelligent agent workflows.

Educational Qualifications:

Engineering Degree BE/ME/BTech/MTech/BSc/MSc.
Technical certification in multiple technologies is desirable.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.