Inferring Compositional 4D Scenes without Ever Seeing One
arXiv:2512.05272v2 Announce Type: replace
Abstract: Scenes in the real world are often composed of several static and dynamic objects. Capturing their 4-dimensional structures, composition and spatio-temporal configuration in-the-wild, though extremel…