cs.AI

BehaviorGuard: Online Backdoor Defense for Deep Reinforcement Learning

arXiv:2605.05977v1 Announce Type: new
Abstract: Backdoor attacks pose a serious threat to deep reinforcement learning (DRL). Current defenses typically rely on reward anomalies to reverse-engineer triggers and model finetuning to remove backdoors. How…