cs.RO

From Action Labels to Sets: Rethinking Action Supervision for Imitation Learning from Corrective Feedback

arXiv:2502.07645v3 Announce Type: replace
Abstract: Behavior cloning (BC) optimizes policies by treating human demonstrations as pointwise action labels. While effective with accurate action labels, this formulation is brittle in practice: when human-…