Decision-Theory

Browse posts by tag

Good and Real

Notes

Demystifying paradoxes from physics to ethics. LessWrong favorite.

The Policy: Q-Learning vs Policy Learning

SIGMA uses Q-learning rather than direct policy learning. This architectural choice makes it both transparent and terrifying. You can read its value function, but what you read is chilling.

AI Fiction