cs.CL

Retrofitting Small Multilingual Models for Retrieval: Matching 7B Performance with 300M Parameters

arXiv:2510.14274v2 Announce Type: replace
Abstract: Training effective multilingual embedding models presents unique challenges due to the diversity of languages and task objectives. Although small multilingual models (1 B) in the most prevalent use c…