FRESH Hacker News
Home
Summary of METR's predeployment evaluation of GPT-5.6 Sol
6 points by pongogogo