Anthropic locked Claude Code to native apps in Jan 2026. Are we still comparing models or just ecosystems
Quick context: both vendors publish SWE-bench scores on their own scaffolds. The same model swings 22 points depending on harness design — more than the gap between any two frontier models. Since January 2026, Claude plans are restricted to Anthr…