cs.CV

Where Do Vision-Language Models Fail? World Scale Analysis for Image Geolocalization

arXiv:2604.16248v1 Announce Type: new
Abstract: Image geolocalization has traditionally been addressed through retrieval-based place recognition or geometry-based visual localization pipelines. Recent advances in Vision-Language Models (VLMs) have dem…