Learning When to Stop: Selective Imitation Learning Under Arbitrary Dynamics Shift
arXiv:2605.09183v1 Announce Type: new
Abstract: Behavior cloning provides strong imitation learning guarantees when training and test environments share the same dynamics. However, in many deployment settings the test environment’s transitions differ …