cs.CL, cs.CV

VC-Inspector: Advancing Reference-free Evaluation of Video Captions with Factual Analysis

arXiv:2509.16538v3 Announce Type: replace-cross
Abstract: We propose VC-Inspector, a lightweight, open-source large multimodal model (LMM) for reference-free evaluation of video captions, with a focus on factual accuracy. Unlike existing metrics that …