cs.LG

Interactive Inverse Reinforcement Learning of Interaction Scenarios via Bi-level Optimization

arXiv:2605.08131v1 Announce Type: new
Abstract: Inverse reinforcement learning (IRL) learns a reward function and a corresponding policy that best fit the demonstration data of an expert. However, in the current IRL setting, the learner is isolated fr…