cs.CV

Empowering Video Translation using Multimodal Large Language Models

arXiv:2604.11283v1 Announce Type: new
Abstract: Recent developments in video translation have further enhanced cross-lingual access to video content, with multimodal large language models (MLLMs) playing an increasingly important supporting role. With…