An MIT-licensed model just hit #1 on SWE-Bench Pro, beating both GPT-5.4 and Claude Opus 4.6 at real-world software engineering. I spent…
An MIT-licensed model just hit #1 on SWE-Bench Pro, beating both GPT-5.4 and Claude Opus 4.6 at real-world software engineering. I spent…