Rethinking Jailbreak Detection of Large Vision Language Models with Representational Contrastive Scoring
arXiv:2512.12069v3 Announce Type: replace-cross
Abstract: Large Vision-Language Models (LVLMs) are vulnerable to a growing array of multimodal jailbreak attacks, necessitating defenses that are both generalizable to novel threats and efficient for pra…