FrameVGGT: Geometry-Aligned Frame-Level Memory for Bounded Streaming VGGT
arXiv:2603.07690v2 Announce Type: replace
Abstract: Streaming Visual Geometry Transformers such as StreamVGGT enable strong online 3D perception, but their KV-cache grows unbounded over long streams, limiting practical deployment. We revisit bounded-m…