Benchmarking Deflection and Hallucination in Large Vision-Language Models
arXiv:2604.12033v1 Announce Type: cross
Abstract: Large Vision-Language Models (LVLMs) increasingly rely on retrieval to answer knowledge-intensive multimodal questions. Existing benchmarks overlook conflicts between visual and textual evidence and th…