Summary of METR's predeployment evaluation of GPT-5.6 Sol

6 points | by pongogogo 10 hours ago

5 comments