MotuBrain: An Advanced World Action Model for Robot Control
arXiv:2604.27792v2 Announce Type: replace
Abstract: Vision-Language-Action (VLA) models generalize semantically well but often lack fine-grained modeling of world dynamics. We present MotuBrain, a unified World Action Model that jointly models video a…