Closing the Modality Reasoning Gap for Speech Large Language Models
arXiv:2601.05543v2 Announce Type: replace
Abstract: Although Speech Large Language Models have achieved notable progress, a substantial modality reasoning gap remains: their reasoning performance on speech inputs is markedly weaker than on text. This …