Jean-Bastien Grill, Michal Valko, R\'emi Munos

Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning

Jean-Bastien Grill, Michal Valko, R\'emi Munos / April 17, 2026

arXiv:2604.14974v1 Announce Type: new
Abstract: You are a robot and you live in a Markov decision process (MDP) with a finite or an infinite number of transitions from state-action to next states. You got brains and so you plan before you act. Luckily…

Author name: Jean-Bastien Grill, Michal Valko, R\'emi Munos

Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning