San Francisco, California
•
Today
812+ years in AI/ML engineering with strong focus on Reinforcement LearningHands-on experience building and deploying RL systems in production (not just research)Deep understanding of RL algorithms (PPO, A3C, SAC, DDPG) and policy optimization techniquesExperience with multi-agent and/or hierarchical reinforcement learningStrong experience integrating RL with LLMs (fine-tuning, RL-based optimization, reasoning)Proficiency in Python and ML frameworks (PyTorch, JAX, TensorFlow, or Ray RLlib)Experi
Easy Apply
Contract
Depends on Experience




