I red-teamed an AI agent with real tool access. 87% of attacks succeeded. Then my own classifier turned out to be wrong — and the real…
I red-teamed an AI agent with real tool access. 87% of attacks succeeded. Then my own classifier turned out to be wrong — and the real…