Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs
arXiv:2512.16378v4 Announce Type: replace
Abstract: As Large Language Models (LLMs) expand beyond text, integrating speech as a native modality has given rise to SpeechLLMs, which directly process spoken language and enable speech-to-text translation …