CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models
arXiv:2605.02202v1 Announce Type: new
Abstract: Vision-Language Models (VLMs) have achieved remarkable success in tasks such as image captioning and visual question answering (VQA). However, as their applications become increasingly widespread, recent…