Sr. Data Scientist

Overview

Remote
On Site
Hybrid
Depends on Experience
Contract - W2
Contract - 12 Month(s)

Skills

AWS
NLP
python

Job Details

Position: Sr. Data Scientist

Location: San Antonio, TX (Onsite twice per month)
Duration: W2 Contract (No C2C)


Role Overview

We are looking for an experienced Data Scientist with strong expertise in Natural Language Processing (NLP), Large Language Models (LLMs), and deep learning to join our team. The role requires hands-on experience with AWS cloud services and a solid background in building and deploying advanced text analytics solutions. Experience with data engineering concepts and tools will be a strong plus.


Key Responsibilities

  • Design, develop, and deploy NLP and text analytics solutions using advanced algorithms and deep learning techniques.
  • Work with LLMs (e.g., GPT, BERT, or similar architectures) for diverse text-processing tasks.
  • Build, optimize, and manage data pipelines and workflows on AWS (S3, Lambda, SageMaker, Glue, EMR).
  • Perform data wrangling, cleaning, and scripting for large-scale unstructured datasets.
  • Research and apply the latest advancements in Aand deep learning.
  • Collaborate with cross-functional teams to translate business needs into actionable insights.
  • Monitor, evaluate, and enhance model performance using robust metrics and validation methods.

Required Skills

  • Strong programming skills in Python (Pandas, NumPy, scikit-learn, PyTorch, TensorFlow, Hugging Face Transformers).
  • Proven experience with AWS services for data science and machine learning.
  • Deep expertise in NLP techniques: text classification, sentiment analysis, entity recognition, topic modeling, summarization, etc.
  • Hands-on experience with LLMs and deep learning architectures for text data.
  • Familiarity with ETL, automation, and data scripting for large-scale processing.
  • Solid understanding of modern NLP algorithms and research trends.
  • Strong problem-solving, communication, and collaboration skills.
  • Experience with MLOps, cloud-based model deployment, and monitoring.

Preferred / Nice to Have

  • Background in data engineering: ETL pipeline development, data modeling, data warehousing.
  • Experience with Apache Spark, AWS Glue, or similar frameworks.
  • Familiarity with workflow orchestration tools (Airflow, Step Functions).
  • Knowledge of optimizing data storage and retrieval for large-scale analytics.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.