cs.CL

Cross-lingual Matryoshka Representation Learning across Speech and Text

arXiv:2602.19991v2 Announce Type: replace
Abstract: Speakers of under-represented languages face both a language barrier, as most online knowledge is in a few dominant languages, and a modality barrier, since information is largely text-based while ma…