Back to Media
Constitutional AI: Harmlessness from AI Feedback
Bai, Kadavath, Kundu, Askell, Kernion, Jones, Chen, et al.
Notes
Self-critique and revision using principles instead of human labels.