Haoming Wang, Wei Gao

Uncovering and Shaping the Latent Representation of 3D Scene Topology in Vision-Language Models

Haoming Wang, Wei Gao / May 11, 2026

arXiv:2605.07148v1 Announce Type: new
Abstract: Decades of cognitive science establish that humans navigate environments by forming cognitive maps, defined as allocentric and topology-preserving representations of 3D space. While modern Vision-Languag…

Author name: Haoming Wang, Wei Gao

Uncovering and Shaping the Latent Representation of 3D Scene Topology in Vision-Language Models