Reinforcement Learning from Human Feedback (RLHF) in Notebooks github.com 48 points by ash_at_hny 5 hours ago
Hl