cs.AI, cs.CL, cs.CR, cs.LG

Rethinking Jailbreak Detection of Large Vision Language Models with Representational Contrastive Scoring

arXiv:2512.12069v3 Announce Type: replace-cross
Abstract: Large Vision-Language Models (LVLMs) are vulnerable to a growing array of multimodal jailbreak attacks, necessitating defenses that are both generalizable to novel threats and efficient for pra…