Leo Benac, Abhishek Sharma, Sonali Parbhoo, Finale Doshi-Velez

Bayesian Inverse Transition Learning: Learning Dynamics From Near-Optimal Trajectories

Leo Benac, Abhishek Sharma, Sonali Parbhoo, Finale Doshi-Velez / April 29, 2026

arXiv:2411.05174v2 Announce Type: replace
Abstract: We consider the problem of estimating the transition dynamics $T^*$ from near-optimal expert trajectories in the context of offline model-based reinforcement learning. We develop a novel constraint-b…

Author name: Leo Benac, Abhishek Sharma, Sonali Parbhoo, Finale Doshi-Velez

Bayesian Inverse Transition Learning: Learning Dynamics From Near-Optimal Trajectories