cs.LG, stat.ML

Why Does Agentic Safety Fail to Generalize Across Tasks?

arXiv:2605.06992v1 Announce Type: new
Abstract: AI agents are increasingly deployed in multi-task settings, where the task to perform is specified at test time, and the agent must generalize to unseen tasks. A major concern in such settings is safety:…