Data Scientist (AI/LLM Focused)

Overview

Remote
$50 - $55
Accepts corp to corp applications
Contract - Independent
Contract - W2
Contract - 12 Month(s)

Skills

Generative AI
Large Language Models (LLMs)
Natural Language Processing (NLP)
Machine Learning (ML)
Deep Learning (DL)
Multimodal Analysis
Data Scientist

Job Details

100% REMOTE

Data Scientist (AI/LLM Focused)

Our Fortune 50 Healthcare Insurance client is seeking a highly skilled Data Scientist resource to play a pivotal role in the development, expansion, operation, and maintenance of generative AI solutions. The primary responsibilities include running an experimental framework to determine the optimal prompt engineering approaches, tuning prompts, and collaborating with subject matter experts (SMEs) for evaluations and results. This role requires a deep understanding of evaluating models output in production, particularly when ground metrics are absent, monitoring for issues such as model drift and hallucinations, and optimizing for offline and online metrics.

Key Responsibilities:

  • Scope, develop, expand, operate, and maintain scalable, reliable and safe generative AI solutions.
  • Design and execute prompt engineering experiments to optimize Large Language Models (LLMs) for various use cases.
  • Collaborate with SMEs to evaluate prompt effectiveness and align AI solutions with business needs.
  • Understand and apply offline and online evaluation metrics for LLMs, ensuring continuous model improvements.
  • Evaluate production models using live data in the absence of ground metrics, implementing robust monitoring systems.
  • Monitor LLM applications for model drift, hallucinations, and performance degradation.
  • Ensure smooth integration of LLMs into existing workflows, providing real-time insights and predictive analytics.

Qualifications:

  • Proven experience in data science, with expertise in managing structured and unstructured data.
  • Proficiency in statistical techniques, predictive analytics, and reporting results.
  • Experience in applied science in fields like Natural Language Processing (NLP), Machine Learning (ML), Deep Learning (DL), or Multimodal Analysis.
  • Strong background in software development, data modeling, or data engineering.
  • Deep understanding of building and scaling ML models, specifically LLMs.
  • Familiarity with open-source tools such as PyTorch, statistical analysis, and data visualization tools.
  • Experience with vector databases and graph databases is a plus.

Preferred Skills:

  • Experience in prompt engineering and prompt optimization.
  • Expertise in running experiments to evaluate generative AI performance.
  • Knowledge of production-level monitoring tools for ML models, including drift detection and mitigation strategies.
  • Excellent problem-solving skills and ability to work cross-functionally with data scientists, engineers, and SMEs.
  • Experience with safety, security and responsible use of AI.
  • Experience with red-teaming (adversarial testing) of generative AI.
  • Experience with developing AI applications with sensitive data such as PHI, PII and highly confidential data.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.