R-CoV: Region-Aware Chain-of-Verification for Alleviating Object Hallucinations in LVLMs
arXiv:2604.20696v1 Announce Type: new
Abstract: Large vision-language models (LVLMs) have demonstrated impressive performance in various multimodal understanding and reasoning tasks. However, they still struggle with object hallucinations, i.e., the c…