cs.AI, cs.LG

Variance-Aware Prior-Based Tree Policies for Monte Carlo Tree Search

arXiv:2512.21648v2 Announce Type: replace
Abstract: Monte Carlo Tree Search (MCTS) has profoundly influenced reinforcement learning (RL) by integrating planning and learning in tasks requiring long-horizon reasoning, exemplified by the AlphaZero famil…