cs.AI, cs.CV

GEASS: Training-Free Caption Steering for Hallucination Mitigation in Vision-Language Models

arXiv:2605.01733v1 Announce Type: cross
Abstract: Vision-Language Models (VLMs) excel at grounded reasoning but remain prone to object hallucination. Recent work treats self-generated captions as a uniformly positive resource, yet we find that naively…