Variance-Aware Prior-Based Tree Policies for Monte Carlo Tree Search
arXiv:2512.21648v2 Announce Type: replace
Abstract: Monte Carlo Tree Search (MCTS) has profoundly influenced reinforcement learning (RL) by integrating planning and learning in tasks requiring long-horizon reasoning, exemplified by the AlphaZero famil…