Why Agents Compromise Safety Under Pressure
arXiv:2603.14975v2 Announce Type: replace-cross
Abstract: Large Language Model agents deployed in complex environments frequently encounter a conflict between maximizing goal achievement and adhering to safety constraints. This paper identifies a new …