natolambert/rlhf-book
Textbook on reinforcement learning from human feedback
aialignmentrlhf
First Claude commit: Mar 18, 2026Last Claude commit: 3mo agoDiscovered: Mar 19, 2026
Recent Claude Commits
Rename Slides to Course page with video links (#304)
a3d430f3mo agoco_authored_byFix lecture 1 title slide and slide 21 line break (#305)
c760fb83mo agoco_authored_by