TOC-Bench: A Temporal Object Consistency Benchmark for Video Large Language Models
arXiv:2605.09904v1 Announce Type: new
Abstract: Video large language models (Video-LLMs) have achieved remarkable progress in general video understanding, yet their ability to maintain temporal object consistency remains insufficiently explored. Exist…