cs.CV

A Scene is Worth a Thousand Features: Feed-Forward Camera Localization from a Collection of Image Features

arXiv:2510.00978v2 Announce Type: replace
Abstract: Visually localizing an image, i.e., estimating its camera pose, requires building a scene representation that serves as a visual map. The representation we choose has direct consequences towards the …