Variational Visual Question Answering for Uncertainty-Aware Selective Prediction
arXiv:2505.09591v3 Announce Type: replace
Abstract: Despite remarkable progress in recent years, Vision Language Models (VLMs) remain prone to overconfidence and hallucinations on tasks such as Visual Question Answering (VQA) and Visual Reasoning. Bay…