GAMBIT: A Gamified Jailbreak Framework for Multimodal Large Language Models
arXiv:2601.03416v3 Announce Type: replace
Abstract: Multimodal Large Language Models (MLLMs) have become widely deployed, yet their safety alignment remains fragile under adversarial inputs. Previous work has shown that increasing inference steps can …