RedDiffuser: Auditing Multimodal Safety Failures in Vision-Language Models via Reinforced Diffusion
arXiv:2503.06223v5 Announce Type: replace
Abstract: Large Vision-Language Models (VLMs) are increasingly deployed in open-ended environments, where ensuring reliable safety under multimodal inputs is critical. However, existing evaluations remain larg…