Mitigating Action-Relation Hallucinations in LVLMs via Relation-aware Visual Enhancement
arXiv:2605.11808v1 Announce Type: new
Abstract: Large Vision-Language Models (LVLMs) have achieved remarkable performance on diverse vision-language tasks. However, LVLMs still suffer from hallucinations, generating text that contradicts the visual in…