Overview
Remote
Depends on Experience
Accepts corp to corp applications
Contract - Independent
Contract - W2
Contract - 54 Month(s)
Able to Provide Sponsorship
Skills
Artificial Intelligence
Large Language Models (LLMs)
Neural Network
Data Science
Deep Learning
Machine Learning (ML)
Pandas
scikit-learn
PPO
Python
TensorFlow
Open Source
PyTorch
Job Details
AI Researcher with deep expertise in LLM fine-tuning, reinforcement learning, recommendation systems, and anomaly detection. You will be part of a cutting-edge team driving innovation in applied AI, delivering models and solutions that power real-world applications.
- Design and implement fine-tuning pipelines for large language models (LLMs) using open-source and proprietary datasets.
- Conduct research and develop models based on reinforcement learning (RL) frameworks such as PPO, DDPG, or A3C.
- Build and optimize recommendation engines leveraging collaborative filtering, matrix factorization, and deep learning.
- Develop and deploy anomaly detection models using statistical, unsupervised, and neural network approaches.
Qualifications:
- Advanced degree (PhD or Master's) in Computer Science, Machine Learning, Data Science, or related field
- Strong background in LLM fine-tuning (e.g., LoRA, PEFT, RLHF, prompt tuning)
- Hands-on experience with reinforcement learning algorithms and toolkits (e.g., Ray RLlib, OpenAI Gym, Stable Baselines3)
- Solid understanding of recommendation system architectures (collaborative filtering, deep learning, hybrid models)
- Experience in anomaly detection using techniques such as Isolation Forest, Autoencoders, or One-Class SVM
- Proficiency in Python and libraries like PyTorch, TensorFlow, Hugging Face Transformers, Scikit-learn, and Pandas
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.