Marco Rando, Samuel Vaiter

On the Hardness of Junking LLMs

Marco Rando, Samuel Vaiter / May 7, 2026

arXiv:2605.05116v1 Announce Type: new
Abstract: Large language models (LLMs) are known to be vulnerable to jailbreak attacks, which typically rely on carefully designed prompts containing explicit semantic structure. These attacks generally operate by…

Author name: Marco Rando, Samuel Vaiter

On the Hardness of Junking LLMs