On Training in Imagination
arXiv:2605.06732v2 Announce Type: replace
Abstract: State-of-the-art model-based reinforcement learning methods train policies on imagined rollouts. These rollouts are trajectories generated by a learned dynamics model and are scored by a learned rewa…