Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos
arXiv:2511.12882v3 Announce Type: replace-cross
Abstract: Embodied world models aim to predict and interact with the physical world through visual observations and actions. However, existing models struggle to accurately translate low-level actions (e…