Back to Media
Deep Reinforcement Learning from Human Preferences
Christiano, Leike, Brown, Martic, Legg, Amodei
Notes
Foundational RLHF paper. Learning reward models from human comparisons.
Foundational RLHF paper. Learning reward models from human comparisons.