DO-Bench: An Attributable Benchmark for Diagnosing Object Hallucination in Vision-Language Models
arXiv:2604.22822v1 Announce Type: cross
Abstract: Object level hallucination remains a central reliability challenge for vision language models (VLMs), particularly in binary object existence verification. Existing benchmarks emphasize aggregate accur…