Turning Generators into Retrievers: Unlocking MLLMs for Natural Language-Guided Geo-Localization
arXiv:2604.10721v1 Announce Type: new
Abstract: Natural-language Guided Cross-view Geo-localization (NGCG) aims to retrieve geo-tagged satellite imagery using textual descriptions of ground scenes. While recent NGCG methods commonly rely on CLIP-style…