See, Remember, Explore: A Benchmark and Baselines for Streaming Spatial Reasoning
arXiv:2603.23864v1 Announce Type: new
Abstract: Spatial understanding is fundamental for embodied agents, yet most spatial VLMs and benchmarks remain offline-evaluating post-hoc QA over pre-recorded inputs and overlooking two crucial deployment-critic…