cs.AI, cs.CR, cs.SE

Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code’s Auto Mode

arXiv:2604.04978v1 Announce Type: cross
Abstract: Claude Code’s auto mode is the first deployed permission system for AI coding agents, using a two-stage transcript classifier to gate dangerous tool calls. Anthropic reports a 0.4% false positive rate …