cs.AI, cs.CV

Revealing Multi-View Hallucination in Large Vision-Language Models

arXiv:2603.23934v1 Announce Type: cross
Abstract: Large vision-language models (LVLMs) are increasingly being applied to multi-view image inputs captured from diverse viewpoints. However, despite this growing use, current LVLMs often confuse or mismat…