WorldSpeech: A Multilingual Speech Corpus from Around the World
arXiv:2605.09167v1 Announce Type: cross
Abstract: Automatic speech recognition (ASR) performs well for high-resource languages with abundant paired audio-transcript data, but its accuracy degrades sharply for most languages due to limited publicly ava…