cs.AI, cs.CL, cs.CR, cs.LG

Policy-Invisible Violations in LLM-Based Agents

arXiv:2604.12177v1 Announce Type: new
Abstract: LLM-based agents can execute actions that are syntactically valid, user-sanctioned, and semantically appropriate, yet still violate organizational policy because the facts needed for correct policy judgm…