cs.CV, cs.LG

A Patch-based Cross-view Regularized Framework for Backdoor Defense in Multimodal Large Language Models

arXiv:2604.04488v1 Announce Type: new
Abstract: Multimodal large language models have become an important infrastructure for unified processing of visual and linguistic tasks. However, such models are highly susceptible to backdoor implantation during…