Data Scientist/Machine Learning Scientist

Data Scientist, Research Scientist, Machine Learning Engineer, AI, Artificial Intelligence, Python and/or R, xgboost/LightGBM, Random Forests, SVMs, PCA, Masters or Ph.D. in Computer Science, Electrical Engineering, Machine Learning, Statistics
Contract W2, Contract Independent, 24 Months
Depends on Experience

Job Description

Title: # RFP - 21506 - Akshay - Data Scientist/Machine Learning Scientist
Location: Atlanta, GA (Remote Till Covid-19)

Job Description:

Data Scientist / Machine Learning Scientist

(Machine Learning)

Who we are and what we do:

  • The AI and Data Science team is centralized across the entire organization.
  • We work with various product teams across various business units to define high-impact business problems, solve them using novel techniques, and execute and monitor them throughout their lifecycle.
  • Most of our models make it to production, they never sit in a research lab. But we also do quite a bit of research to stay up-to-date with the latest technologies/algorithms.
  • We are very collaborative, you will likely get lots of ideas from the team.


What kind of problems do we solve:


  • Our locomotives stream 350+ sensor information in real-time. We create predictive models to predict various component failures hours, days, and sometimes months in advance.
  • There are high-frame cameras beside our tracks, capturing images of trains and rail cars as they pass. We design various Deep Learning and Computer Vision algorithms to detect certain objects of interest or issues and defects. We then optimize their performance and deploy them at the edge for real-time scoring and notification of our mechanical personnel upon detections.
  • Want to learn more? Apply today!

What tools do we use:

  • We use Python, R, and Spark (PySpark, SparkR, Scala) for modeling and EDA.
    • You will have a local machine with 512GB of memory, so feel free to load the data in memory if it makes sense or if it fits (!)
  • You will also have terabytes of memory in our Spark cluster that is not shared by anyone.
  • We use Jupyter notebook, Emacs, PyCharm, Rstudio as IDEs.
  • We use Tensorflow, Keras, PyTorch, and MXNet for Deep Learning, and OpenCV for traditional Computer Vision.
    • You will have your own dedicated GPU (!) in addition to a GPU cluster to run parallel training and inference jobs.
  • We always have the latest versions of our tools/packages/libraries available.


What are our requirements:

  • Bachelor’s, Master’s or Ph.D. in Computer Science, Electrical Engineering, Machine Learning, Statistics or related field
  • Minimum of 1 year of relevant industry experience (as a Data Scientist, Research Scientist, Machine Learning Engineer, etc.), 2+ preferred; or proven qualifications.
  • Hands-on and theoretical knowledge of various Machine Learning algorithms and tools, e.g. xgboost/LightGBM, Random Forests, SVMs, PCA, t-sne, kmeans, DBSCAN, etc.
  • Expertise with Time Series problems is a plus
  • Excellent knowledge of Python and/or R, knowledge of Spark is a plus

What will be your duties:


  • Effectively utilize appropriate statistical and Machine Learning models and techniques to solve various business problems
  • Collaborate with various departments to identify opportunities for process improvement and developing analytics use-cases.
  • Evaluate accuracy and quality of data sources, as well as the designed models
  • Stays up to date with the latest models and changes in the technology
  • Communicate results to colleagues and business partners.


Best Regards,

Sethu Mathan, MBA
Manager, IT Strategic Solution

M: +1    |     F: +1-


7000 Peachtree Dunwoody Rd | Bldg. 11 , Suite 301

Atlanta , GA – 30328

Dice Id : 90773857
Position Id : 7255788
Originally Posted : 3 weeks ago
Have a Job? Post it