AI Research Scientist LLM & RL

Overview

Remote

Depends on Experience

Accepts corp to corp applications

Contract - Independent

Contract - W2

Contract - 54 Month(s)

Able to Provide Sponsorship

Skills

Artificial Intelligence

Large Language Models (LLMs)

Neural Network

Data Science

Deep Learning

Machine Learning (ML)

Pandas

scikit-learn

PPO

Python

TensorFlow

Open Source

PyTorch

Job Details

AI Researcher with deep expertise in LLM fine-tuning, reinforcement learning, recommendation systems, and anomaly detection. You will be part of a cutting-edge team driving innovation in applied AI, delivering models and solutions that power real-world applications.

Design and implement fine-tuning pipelines for large language models (LLMs) using open-source and proprietary datasets.
Conduct research and develop models based on reinforcement learning (RL) frameworks such as PPO, DDPG, or A3C.
Build and optimize recommendation engines leveraging collaborative filtering, matrix factorization, and deep learning.
Develop and deploy anomaly detection models using statistical, unsupervised, and neural network approaches.

Qualifications:

Advanced degree (PhD or Master's) in Computer Science, Machine Learning, Data Science, or related field
Strong background in LLM fine-tuning (e.g., LoRA, PEFT, RLHF, prompt tuning)
Hands-on experience with reinforcement learning algorithms and toolkits (e.g., Ray RLlib, OpenAI Gym, Stable Baselines3)
Solid understanding of recommendation system architectures (collaborative filtering, deep learning, hybrid models)
Experience in anomaly detection using techniques such as Isolation Forest, Autoencoders, or One-Class SVM
Proficiency in Python and libraries like PyTorch, TensorFlow, Hugging Face Transformers, Scikit-learn, and Pandas

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share