cs.CV

BoxComm: Benchmarking Category-Aware Commentary Generation and Narration Rhythm in Boxing

arXiv:2604.04419v1 Announce Type: new
Abstract: Recent multimodal large language models (MLLMs) have shown strong capabilities in general video understanding, driving growing interest in automatic sports commentary generation. However, existing benchm…