WorldComp2D: Spatio-semantic Representations of Object Identity and Location from Local Views
arXiv:2605.11743v1 Announce Type: new
Abstract: Learning latent representations that capture both semantic and spatial information is central to efficient spatio-semantic reasoning. However, many existing approaches rely on implicit latent structures …