GPO-V: Jailbreak Diffusion Vision Language Model by Global Probability Optimization
arXiv:2605.07399v2 Announce Type: replace
Abstract: Diffusion Vision-Language Models (dVLMs), built upon the non-causal foundations of Diffusion Large Language Models (dLLMs), have demonstrated remarkable efficacy in multimodal tasks by departing from…