Data Scientist with Gen AI

Overview

On Site
Depends on Experience
Accepts corp to corp applications
Contract - Independent
Contract - 12 Month(s)

Skills

Apache Hadoop
Apache Spark
BERT
Deep Learning
Microsoft Azure
Jupyter
Keras
Named-Entity Recognition (NER)
Natural Language Processing
Python
PyTorch
NLTK

Job Details

Position: Data Scientist with Gen AI

Location: Plano, TX (Day 1 Onsite)

Duration: Long Term

Role Summary:

  • Programming Languages: Advanced proficiency in Python (preferred) and/or R; experience with Jupyter notebooks.
  • NLP Libraries & Frameworks: Strong hands-on experience with NLTK, spaCy, Gensim, Hugging Face Transformers, and Scikit-learn.
  • Text Preprocessing: Expertise in processing noisy, unstructured text from various data sources
  • Domain-Specific NLP: Familiarity with entity recognition, intent detection, and text classification
  • Machine Learning: Solid foundation in supervised and unsupervised learning, with applications to telecom problems (e.g., anomaly detection, predictive maintenance, customer segmentation).
  • Deep Learning for NLP: Experience with deep learning frameworks (TensorFlow, PyTorch, Keras) for advanced NLP tasks (LSTM, Transformers, BERT, GPT).
  • Data Handling: Proficient in handling large-scale, high-velocity telecom datasets; experience with distributed data processing (Spark, Hadoop) is a plus.
  • Evaluation: Design and interpret experiments to evaluate NLP models, including error analysis and business impact assessment.
  • Statistical Analysis: Strong understanding of statistics and probability as applied to telecom service quality and customer experience.
  • Azure experience preferred
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.