cs.CL

StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs

arXiv:2509.22220v2 Announce Type: replace
Abstract: Prevalent semantic speech tokenizers, designed to capture linguistic content, are surprisingly fragile. We find they are not robust to meaning-irrelevant acoustic perturbations; even at high Signal-t…