I tested GLM-5.1 — it beat GPT-5.4 & Claude Opus 4.6 and is 7.8× cheaper.

An MIT-licensed model just hit #1 on SWE-Bench Pro, beating both GPT-5.4 and Claude Opus 4.6 at real-world software engineering. I spent…