cs.LG

Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning

arXiv:2604.14974v1 Announce Type: new
Abstract: You are a robot and you live in a Markov decision process (MDP) with a finite or an infinite number of transitions from state-action to next states. You got brains and so you plan before you act. Luckily…