Mitigating Multimodal Hallucination via Phase-wise Self-reward
arXiv:2604.17982v1 Announce Type: cross
Abstract: Large Vision-Language Models (LVLMs) still struggle with vision hallucination, where generated responses are inconsistent with the visual input. Existing methods either rely on large-scale annotated da…