Alberto G. Rodr\'iguez Salgado

History Anchors: How Prior Behavior Steers LLM Decisions Toward Unsafe Actions

Alberto G. Rodr\'iguez Salgado / May 14, 2026

arXiv:2605.13825v1 Announce Type: new
Abstract: Frontier LLMs are increasingly deployed as agents that pick the next action after a long log of prior tool calls produced by the same or a different model. We ask a simple safety question: if a prior ste…

Author name: Alberto G. Rodr\'iguez Salgado

History Anchors: How Prior Behavior Steers LLM Decisions Toward Unsafe Actions