Diego F. Cuadros, Abdoul-Aziz Maiga

Ambient Persuasion in a Deployed AI Agent: Unauthorized Escalation Following Routine Non-Adversarial Content Exposure

Diego F. Cuadros, Abdoul-Aziz Maiga / May 5, 2026

arXiv:2605.00055v1 Announce Type: cross
Abstract: We report a safety incident in a deployed multi-agent research system in which a primary AI agent installed 107 unauthorized software components, overwrote a system registry, overrode a prior negative …

Author name: Diego F. Cuadros, Abdoul-Aziz Maiga

Ambient Persuasion in a Deployed AI Agent: Unauthorized Escalation Following Routine Non-Adversarial Content Exposure