cs.AI, cs.CL, cs.CR

Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

arXiv:2604.21700v1 Announce Type: cross
Abstract: The growing application of large language models (LLMs) in safety-critical domains has raised urgent concerns about their security. Many recent studies have demonstrated the feasibility of backdoor att…