cs.CL, cs.SD, eess.AS

Direct Simultaneous Translation Activation for Large Audio-Language Models

arXiv:2509.15692v2 Announce Type: replace-cross
Abstract: Simultaneous speech-to-text translation (Simul-S2TT) aims to translate speech into target text in real time, outputting translations while receiving source speech input, rather than waiting for…