cs.AI, cs.CV

Variational Visual Question Answering for Uncertainty-Aware Selective Prediction

arXiv:2505.09591v3 Announce Type: replace
Abstract: Despite remarkable progress in recent years, Vision Language Models (VLMs) remain prone to overconfidence and hallucinations on tasks such as Visual Question Answering (VQA) and Visual Reasoning. Bay…