Back to Media
Deep Reinforcement Learning from Human Preferences
Christiano, Leike, Brown, Martic, Legg, Amodei
Notes
Foundational RLHF paper. Learning reward models from human comparisons.