I Tested GPT-5.4 vs Claude Opus 4.6 on 20 Real Tasks — The #1 Model on LMSYS Isn’t What You Think

Two days ago, Claude Opus 4.6 quietly took the #1 spot on the LMSYS Chatbot Arena with an Elo score of 1504 — the highest any model has…

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top