FRESH Hacker News
Home
Steering interpretable language models with concept algebra
36 points by luulinh90s