cs.CL

ParlaSpeech 3.0: Richly Annotated Spoken Parliamentary Corpora of Croatian, Czech, Polish, and Serbian

arXiv:2511.01619v2 Announce Type: replace
Abstract: ParlaSpeech is a collection of spoken parliamentary corpora currently spanning four Slavic languages – Croatian, Czech, Polish and Serbian – all together 6 thousand hours in size. The corpora were bu…