cs.AI, cs.LG

WIMLE: Uncertainty-Aware World Models with IMLE for Sample-Efficient Continuous Control

arXiv:2602.14351v2 Announce Type: replace-cross
Abstract: Model-based reinforcement learning promises strong sample efficiency but often underperforms in practice due to compounding model error, unimodal world models that average over multi-modal dyna…