SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation
arXiv:2509.25358v4 Announce Type: replace
Abstract: Large-scale robot learning has made progress on complex manipulation tasks, yet long horizon, contact rich problems, especially those involving deformable objects, remain challenging due to inconsist…