Mdp

Browse posts by tag

April 24, 2026

Reinforcement Learning: An Introduction

Notes

The RL bible. Bandits to policy gradients to planning.

December 17, 2025

Reinforcement Learning: An Introduction

Notes

Mathematical RL fundamentals (MDPs, value functions, dynamic programming, approximate methods). RL foundational text that bridges theory and practice.

March 12, 2024

The AI Course: Everything is Utility Maximization

Intelligence as utility maximization under uncertainty. A unifying framework connecting A* search, reinforcement learning, Bayesian networks, and MDPs.