Overview
Skills
Job Details
Key Required Skills:
Strong knowledge of AI/ML/LLM, Python, NLP, Generative AI and experience in the clinical domain.
Position Description:
Hands on experience in Python, NLP, ML and Generative AI
Understand real world challenges and develop automated data solutions
Develop, test, and deploy new techniques for NLP understanding
Scalable development/deployment of ML and Generative AI approaches (such as Large Language
Models (LLMs)
Train and optimize NLP/LLM models and create Python based pipelines
Determine the nature of analytic problems, evaluate options, and offer recommendations for
resolution.
Advise on the methods and data needed and/or available to evaluate the (intelligence or data)
problem.
Collaborate with data collectors and analysts to identify and close gaps on complex monitoring
problems.
Provide accurate, timely, complex, and sophisticated data analysis.
Skills Requirements:
Basic Skills:
o Bachelors degree in Statistics, Applied Mathematics, Computer Science, or Information Science
with industry experience on Python, NLP, data science, AI/ML/LLM engineering.
o Minimum 8 Year (s) of Data Scientist experience
o Must be able to obtain and maintain a Public Trust. Contract requirement.
Required Skills:
o Experience with Natural Language Processing (NLP), Generative AI and Large Language Models
(LLM)
o Fluency in Python Programming, version control and collaboration with GIT, standard Python
packages (ex. Pandas, numpy, matplotlib) and ML frameworks
o Knowledge of TensorFlow, PyTorch, Pandas, scikit-learn, NLTK, Azure ML (optional), Amazon Web
Services EC2.
o Experience with scalable data engineering frameworks such as Apache Spark and orchestration
frameworks such as Airflow, and/or experience with semantic search.
o Expert knowledge in conducting data analysis and applying advanced statistical concepts and ML
methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic
models.
o Experience with ML model deployment and operations like DevOps, MLOps, LLMOps.
o Experience with NLP and Generative AI libraries like regular expressions (e.g., spacy, langchain),
text annotation tools and semantic frameworks.
o Ability to clean and process large amounts of real-world data.
o Experience retrieving and manipulating data from a variety of data sources included DB2, Oracle,
SQL Server, Hadoop and flat files.
o Experience with database management systems (e.g., PostgresSQL, MySQL, SQLite, SQL, etc.)
o Excellent analytical skills to identify potential risks and propose effective solutions.
o Excellent problem-solving skills, ability to collaborate with cross-functional teams and proven
communication in written and verbal formats to various audiences to include executive leadership.
Desired Skills:
o Prior experience working on applications in the clinical domain.
o Prior experience with federal or state governments IT projects.
o Industry experience preferred.
o Experience building AI chatbot.
o Experience with, or the ability and willingness to learn distributed processing via the Hadoop
ecosystem, i.e., Spark, Impala and Hive.
o Experience working in an analytical research environment.
o Experience in parallel processing such as GPU programming with CUDA
o Experience with Mathematica
o Experience using markup languages such as LaTeX, HTML, etc.
o Experience with Natural Language Processing for anomaly detection
Education:
o Bachelors degree with 12+ years of experience
o Must be able to obtain and maintain a Public Trust. Contract requirement.