FRESH

Hacker News

RLHF from Scratch

61 points by onurkanbkrc

by fauria

0 subcomment

RLHF: Reinforcement learning from human feedback - https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu...

by alansaber

0 subcomment

Looks good. I am a big advocate for these hands on demos as being the best way for beginners to learn ML