FRESH Hacker News
Home
DeepSWE: A contamination-free benchmark for long-horizon coding agents
59 points by ammar_x