First Logit Boosting: Visual Grounding Method to Mitigate Object Hallucination in Large Vision-Language Models
arXiv:2604.00455v1 Announce Type: cross
Abstract: Recent Large Vision-Language Models (LVLMs) have demonstrated remarkable performance across various multimodal tasks that require understanding both visual and linguistic inputs. However, object halluc…