cs.AI, cs.CV, cs.LG

Streaming 4D Visual Geometry Transformer

arXiv:2507.11539v2 Announce Type: replace
Abstract: Perceiving and reconstructing 3D geometry from videos is a fundamental yet challenging computer vision task. To facilitate interactive and low-latency applications, we propose a streaming visual geom…