cs.CL

Speech LLMs are Contextual Reasoning Transcribers

arXiv:2604.00610v1 Announce Type: new
Abstract: Despite extensions to speech inputs, effectively leveraging the rich knowledge and contextual understanding of large language models (LLMs) in automatic speech recognition (ASR) remains non-trivial, as t…