Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising
arXiv:2604.26694v1 Announce Type: cross
Abstract: We propose X-WAM, a Unified 4D World Model that unifies real-time robotic action execution and high-fidelity 4D world synthesis (video + 3D reconstruction) in a single framework, addressing the critica…