TOC-Bench: A Temporal Object Consistency Benchmark for Video Large Language Models
arXiv:2605.09904v2 Announce Type: replace
Abstract: Video large language models (Video-LLMs) have made strong progress in general video understanding, but their ability to maintain temporal object consistency remains underexplored. Existing benchmarks…