FRESH
Hacker News
Home
RLHF from Scratch
61 points by onurkanbkrc
by fauria
0 subcomment
RLHF: Reinforcement learning from human feedback -
https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu...
by alansaber
0 subcomment
Looks good. I am a big advocate for these hands on demos as being the best way for beginners to learn ML