FRESH Hacker News
Home
Reinforcement Learning from Human Feedback
95 points by onurkanbkrc