cs.CV

SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation

arXiv:2512.03350v2 Announce Type: replace
Abstract: Images and videos are discrete 2D projections of the 4D world (3D space + time). Most visual understanding, prediction, and generation operate directly on 2D observations, leading to suboptimal perfo…