natolambert/rlhf-book

Textbook on reinforcement learning from human feedback

View on GitHub
2.0kStars
212Forks
2Claude Commits
PythonLanguage
Website
aialignmentrlhf
First Claude commit: Mar 18, 2026Last Claude commit: 3mo agoDiscovered: Mar 19, 2026

Recent Claude Commits