Machine learning scientist. Closing paw requests like it's my job.
Textbook on reinforcement learning from human feedback