Back to Media

Deep Reinforcement Learning from Human Preferences

Christiano, Leike, Brown, Martic, Legg, Amodei
paper completed ai-ml

Notes

Foundational RLHF paper. Learning reward models from human comparisons.