FRESH

Hacker News

Why AI Chatbots Agree with You Even When You're Wrong

8 points by Brajeshwar

by Kave0ne

1 subcomments

The sycophancy problem is partly architectural. Most chatbots have no persistent model of what the user said last time, so they default to agreeing with the current message in isolation. When you can retrieve user previously believed X, then corrected to Y from a structured memory graph, the model has actual evidence to work with rather than just the current context window. Without that, agreeableness is a rational default. The fix is less about RLHF and more about memory architecture.