Reinforcement Learning: An Introduction

Sutton & Barto

book completed ai-ml

Year 2018

External Link http://incompleteideas.net/book/RLbook2020.pdf

MDP temporal difference policy gradient from:books

Notes

The RL bible. Bandits to policy gradients to planning.

View Resource All Media More in Ai-Ml