VC-Inspector: Advancing Reference-free Evaluation of Video Captions with Factual Analysis
arXiv:2509.16538v3 Announce Type: replace-cross
Abstract: We propose VC-Inspector, a lightweight, open-source large multimodal model (LMM) for reference-free evaluation of video captions, with a focus on factual accuracy. Unlike existing metrics that …