cs.AI, cs.CL, eess.AS

Streaming Speech-to-Text Translation with a SpeechLLM

arXiv:2605.14766v1 Announce Type: cross
Abstract: Normally, a system that translates speech into text consists of separate modules for speech recognition and text-to-text translation. Combining those tasks into a SpeechLLM promises to exploit paraling…