Overview
On Site
Full Time
Skills
Apache Hive
XGBoost
Logistic Regression
Training
Machine Learning (ML)
Real-time
A/B Testing
Apache Spark
Scheduling
Unix
Linux
Operating Systems
Shell Scripting
Job Scheduling
Apache Hadoop
Conflict Resolution
Problem Solving
Attention To Detail
FOCUS
DirectShow
DS
Generative Artificial Intelligence (AI)
Python
PySpark
MongoDB
Microsoft Azure
Cloud Computing
Databricks
Job Details
Azure-Data Scientist
Competent Data Scientist, who is independent, results-driven, and is capable of taking business requirements and building out the technologies to generate statistically sound analysis and production-grade ML models
Note: Job Description and Background Check
Candidates may be subjected to a Background Check /Drug Test as required by the end client before the assignment starts.
Competent Data Scientist, who is independent, results-driven, and is capable of taking business requirements and building out the technologies to generate statistically sound analysis and production-grade ML models
- Independent problem solver/go getter
- Data Scientist with expert-level experience in the Hadoop ecosystem and analysis tools, including Hive/Spark
- Strong understanding of the Azure framework. Azure associate certification preferred.
- Expert in Databricks
- Experience building H2O models (xgboost, logistic regression, neural networks, random forest)
- Experience with MongoDB
- Expertise in R and Jupiter notebooks
- Expertise in Python/Spark and their related libraries and frameworks
- Experience in building training ML pipelines and efforts involved in ML Model deployment
- Experience in other ML concepts - Real-time distributed model inferencing pipeline, Champion/Challenger framework, A/B Testing, Model
- Expert in using the larger Hadoop ecosystem
- Experienced with creating and submitting Spark jobs (scheduling)
- Unix/Linux expertise; comfortable with the Linux operating system and Shell Scripting
- Familiar with job scheduling challenges in Hadoop
- Excellent problem-solving skills, with attention to detail, focus on quality and timely delivery of assigned tasks.
- Databricks SME
- Core DS skills with GenAI, hands-on in Python, Pyspark and MongoDB.
- Azure cloud and Databricks prior knowledge will be a big plus.
Note: Job Description and Background Check
Candidates may be subjected to a Background Check /Drug Test as required by the end client before the assignment starts.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.