RISE: Self-Improving Robot Policy with Compositional World Model
arXiv:2602.11075v2 Announce Type: replace
Abstract: Despite the sustained scaling on model capacity and data acquisition, Vision-Language-Action (VLA) models remain brittle in contact-rich and dynamic manipulation tasks, where minor execution deviatio…