V-Reflection: Transforming MLLMs from Passive Observers to Active Interrogators
arXiv:2604.03307v1 Announce Type: new
Abstract: Multimodal Large Language Models (MLLMs) have achieved remarkable success, yet they remain prone to perception-related hallucinations in fine-grained tasks. This vulnerability arises from a fundamental l…