Yassin H. Rassul, Tarik A. Rashid

AgentShield: Deception-based Compromise Detection for Tool-using LLM Agents

Yassin H. Rassul, Tarik A. Rashid / May 13, 2026

arXiv:2605.11026v1 Announce Type: cross
Abstract: Defenses against indirect prompt injection (IPI) in tool-using LLM agents share two structural weaknesses. First, they all attempt to prevent attacks rather than detect the compromises that slip throug…

Author name: Yassin H. Rassul, Tarik A. Rashid

AgentShield: Deception-based Compromise Detection for Tool-using LLM Agents