Less Detail, Better Answers: Degradation-Driven Prompting for VQA
arXiv:2604.04838v2 Announce Type: replace
Abstract: Recent advancements in Vision-Language Models (VLMs) have significantly pushed the boundaries of Visual Question Answering (VQA).However,high-resolution details can sometimes become noise that leads …