cs.AI, cs.CL

Learning from Contrasts: Synthesizing Reasoning Paths from Diverse Search Trajectories

arXiv:2604.11365v1 Announce Type: cross
Abstract: Monte Carlo Tree Search (MCTS) has been widely used for automated reasoning data exploration, but current supervision extraction methods remain inefficient. Standard approaches retain only the single h…