Streaming Speech-to-Text Translation with a SpeechLLM
arXiv:2605.14766v1 Announce Type: cross
Abstract: Normally, a system that translates speech into text consists of separate modules for speech recognition and text-to-text translation. Combining those tasks into a SpeechLLM promises to exploit paraling…