Antonis Asonitis, Luca A. Lanzend\"orfer, Fr\'ed\'eric Berdoz, Roger Wattenhofer

WorldSpeech: A Multilingual Speech Corpus from Around the World

Antonis Asonitis, Luca A. Lanzend\"orfer, Fr\'ed\'eric Berdoz, Roger Wattenhofer / May 12, 2026

arXiv:2605.09167v1 Announce Type: cross
Abstract: Automatic speech recognition (ASR) performs well for high-resource languages with abundant paired audio-transcript data, but its accuracy degrades sharply for most languages due to limited publicly ava…

Author name: Antonis Asonitis, Luca A. Lanzend\"orfer, Fr\'ed\'eric Berdoz, Roger Wattenhofer

WorldSpeech: A Multilingual Speech Corpus from Around the World