FreeRet: MLLMs as Training-Free Retrievers
arXiv:2509.24621v2 Announce Type: replace
Abstract: Multimodal large language models (MLLMs) are emerging as versatile foundations for mixed-modality retrieval. Yet, they often require heavy post-hoc training to convert them into contrastive encoders …