Remote
•
Today
Need operations research type of background. Hands on experience building deep neural networks from scratch. Heavy on the math side of things. Experience:3+ years building and deploying enterprise scaledecisionsystems. Hands on experience with implementing policy gradient methods (PPO, A3C), value based approaches (DQN, Q-learning) and off policy algorithms. Deep familiarity with the bellman equation, reward shaping, exploration-exploitation tradeoff, constraint mapping and knowing common failur
Easy Apply
Full-time
Up to $90,000











