VEBench:Benchmarking Large Multimodal Models for Real-World Video Editing
arXiv:2605.03276v1 Announce Type: new
Abstract: Real-world video editing demands not only expert knowledge of cinematic techniques but also multimodal reasoning to select, align, and combine footage into coherent narratives. While recent Large Multimo…