Data Scientist

Houston, TX, US • Posted 8 hours ago • Updated 8 hours ago
Contract W2
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

  • NLP
  • Python
  • Machine Learning
  • Large Language Models (LLMs)
  • LangGraph

Summary

Position: Data Scientist

Location: Houston, TX 3days/week

Experience: 8+ Years

Build, deploy, and operationalize scalable AI-powered clinical NLP and machine learning solutions using deep learning, LLMs, and cloud-native big data platforms in healthcare environments.

Responsibilities: -

  • Analyze and process large volumes of unstructured clinical and healthcare text using advanced NLP, machine learning, and deep learning techniques
  • Enhance and optimize existing AI/NLP workflows by designing and implementing state-of-the-art algorithms, including Large Language Models (LLMs) and agentic frameworks such as LangGraph, to improve performance, scalability, and usability
  • Develop, maintain, and extend modular NLP components using Python and other relevant programming or scripting languages
  • Perform comprehensive text pre-processing, data quality assessments, feature engineering, and validation of NLP model outputs
  • Design, implement, and execute systematic testing frameworks, error-handling mechanisms, and model performance evaluation methodologies
  • Build, deploy, and manage end-to-end ML pipelines following MLOps best practices, including versioning, monitoring, retraining, and CI/CD automation
  • Create and maintain technical documentation, model documentation, testing reports, and user manuals
  • Design and develop scalable data pipelines for extraction, transformation, and loading (ETL) from diverse data sources, including MCP servers
  • Leverage SQL and AWS big data technologies such as EMR, Spark, and PySpark for large-scale data processing
  • Collaborate closely with Engineering, Data, and Platform teams to design, deploy, and optimize robust and secure AI/NLP infrastructure
  • Utilize AWS services for model development and deployment, including AWS Bedrock for generative AI applications
  • Work with relational databases to manage structured and semi-structured data efficiently.

Educational Qualifications: -

  • Engineering Degree BE/ME/BTech/MTech/BSc/MSc.
  • Technical certification in multiple technologies is desirable.

Skills: Mandatory skills

  • Strong hands-on expertise in Natural Language Processing (NLP), machine learning, and deep learning
  • Proficiency in Python for building and deploying NLP and ML solutions
  • Experience working with Large Language Models (LLMs), prompt engineering, and agentic workflows (e.g., LangGraph or similar frameworks)
  • Solid understanding of data pre-processing, normalization, feature extraction, and quality validation techniques
  • Strong MLOps experience, including model versioning, pipeline orchestration, CI/CD, monitoring, performance tracking, and retraining strategies
  • Experince to containerization and orchestration tools (Docker, Kubernetes
  • Proficiency in SQL for data querying, transformation, and analytic
  • Practical experience with AWS big data and compute services (EMR, Spark, PySpark)
  • Working knowledge of AWS services, including AWS Bedrock for generative AI use cases
  • Experience with at least one relational database: PostgreSQL or MySQL
  • Strong understanding of testing strategies, error analysis, and model validation techniques

Good-to-Have Skills

  • Prior experience with clinical, biomedical, or healthcare NLP use cases
  • Familiarity with healthcare data standards, terminologies, or ontologies
  • Experience deploying ML/NLP solutions in regulated or production healthcare environments
  • Knowledge of distributed systems and cloud-native data architectures
  • Experience with additional data stores, data warehouses, or NoSQL technologies
  • Strong technical documentation and stakeholder communication skills
  • Experience working in agile or cross-functional product development teams

Thanks & Regards,

Bhupender Singh

XL Impex Inc dba

Atika Technologies

5 Independence Way, Suite 300,

Princeton, NJ 08540

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10506616
  • Position Id: 8935167
  • Posted 8 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Houston, Texas

12d ago

Easy Apply

Third Party, Contract

Depends on Experience

Remote or Houston, Texas

Today

Contract, Third Party

$80 - $90 hourly

Hybrid in Houston, Texas

8d ago

Easy Apply

Contract

Depends on Experience

Houston, Texas

Today

Easy Apply

Contract

USD 58.00 - 62.00 per hour

Search all similar jobs