Generalizable Dense Reward for Long-Horizon Robotic Tasks
arXiv:2604.00055v1 Announce Type: cross
Abstract: Existing robotic foundation policies are trained primarily via large-scale imitation learning. While such models demonstrate strong capabilities, they often struggle with long-horizon tasks due to dist…