cs.CV

VidHal: Benchmarking Temporal Hallucinations in Vision LLMs

arXiv:2411.16771v3 Announce Type: replace
Abstract: Vision Large Language Models (VLLMs) are widely acknowledged to be prone to hallucinations. Existing research addressing this problem has primarily been confined to image inputs, with limited explora…