Rules Validation Engineer

Remote • Posted 10 hours ago • Updated 10 hours ago
Full Time
Remote
Up to $90,000/yr
Fitment

Dice Job Match Score™

📊 Calculating match score...

Job Details

Skills

  • Decision Analysis
  • Decision Lens

Summary

Need operations research type of background. Hands on experience building deep neural networks from scratch. Heavy on the math side of things.

Experience:3+ years building and deploying enterprise scaledecisionsystems. Hands on experience with implementing policy gradient methods (PPO, A3C), value based approaches (DQN, Q-learning) and off policy algorithms. Deep familiarity with the bellman equation, reward shaping, exploration-exploitation tradeoff, constraint mapping and knowing common failure points of real world reinforcement learning systems. Ability to diagnose issues with policy learning and collapse, credit assignment issues, and distributional shifts affecting performance of the model.

Key Skills:Deep learning frameworks (tensorflow, pytorch), linear programming, markovdecisionprocesses, Actor-Critic methods, Offline RL methods(CQL,Decisiontransformer), probabilistic modeling, databricks, Ray Rllib, Gymnasium, PettingZoo (MARL).

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10199915
  • Position Id: 8936440
  • Posted 10 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Full-time

USD 129,300.00 - 177,800.00 per year

Remote or Irving, Texas

Today

Full-time

USD 158,000.00 - 263,300.00 per year

Remote or Wellesley, Massachusetts

Today

Full-time

USD 86,520.00 - 173,040.00 per year

Remote or Arizona

3d ago

Full-time

USD 107,500.00 per year

Search all similar jobs