The Phrase Gap: AI Won’t Pull the Trigger, But It’ll Hand You the Loaded Gun

I red-teamed an AI agent with real tool access. 87% of attacks succeeded. Then my own classifier turned out to be wrong — and the real…