Back to Media
Reinforcement Learning: An Introduction
Sutton & Barto
Notes
The RL bible. Bandits to policy gradients to planning.