Bayesian Inverse Transition Learning: Learning Dynamics From Near-Optimal Trajectories
arXiv:2411.05174v2 Announce Type: replace
Abstract: We consider the problem of estimating the transition dynamics $T^*$ from near-optimal expert trajectories in the context of offline model-based reinforcement learning. We develop a novel constraint-b…