OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction
arXiv:2604.10647v3 Announce Type: replace
Abstract: UMI-style interfaces enable scalable robot learning, but existing systems remain largely visuomotor, relying primarily on RGB observations and trajectory while providing only limited access to physic…