Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code’s Auto Mode
arXiv:2604.04978v1 Announce Type: cross
Abstract: Claude Code’s auto mode is the first deployed permission system for AI coding agents, using a two-stage transcript classifier to gate dangerous tool calls. Anthropic reports a 0.4% false positive rate …