Jean-Bastien Grill, Omar Darwiche Domingues, Pierre M\'enard, R\'emi Munos, Michal Valko

Planning in entropy-regularized Markov decision processes and games

Jean-Bastien Grill, Omar Darwiche Domingues, Pierre M\'enard, R\'emi Munos, Michal Valko / April 22, 2026

arXiv:2604.19695v1 Announce Type: new
Abstract: We propose SmoothCruiser, a new planning algorithm for estimating the value function in entropy-regularized Markov decision processes and two-player games, given a generative model of the environment. Sm…

Author name: Jean-Bastien Grill, Omar Darwiche Domingues, Pierre M\'enard, R\'emi Munos, Michal Valko

Planning in entropy-regularized Markov decision processes and games