Data Scientist

  • Plano, TX
  • Posted 1 day ago | Updated 1 day ago

Overview

On Site
55
Contract - W2
Contract - Independent
Contract - 9 Month(s)
No Travel Required
Unable to Provide Sponsorship

Skills

Data Bricks
Azure
Mongo DB
SQL Indexing
Python OOPS
MongoDB
Hadoop

Job Details

 

Data Scientist

Primarily looking at:
Data Bricks, Azure, Mongo DB, SQL Indexing, Python OOPS for modularizing notebooks into python library.

Lastly LLM fine tuning using LORA and QLORA.

 

Job Description:

 

  • Competent Data Scientist, who is independent, results driven and is capable of taking business requirements and building out the technologies to generate statistically sound analysis and production grade ML models.
  • DS skills with GenAI and LLM Knowledge.
  • Experience building H2O models (xgboost, logistic regression, neural networks, random forest).
  • Experience with MongoDB and NO-SQL Datasets.
  • Experience in Hadoop ecosystem, Databricks and Pyspark.
  • Expertise in Python/Spark and their related libraries and frameworks.
  • Experience in building training ML pipelines and efforts involved in ML Model deployment.
  • Experience in other ML concepts – Real time distributed model inferencing pipeline, Champion/Challenger framework, A/B Testing, Model
    Unix/Linux expertise; comfortable with Linux operating system and Shell Scripting.
  • Familiar with DS/ML Production implementation.
  • Excellent problem-solving skills, with attention to detail, focus on quality and timely delivery of assigned tasks.
  • Azure cloud and Databricks prior knowledge will be a big plus.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About SESHENG LLC