cs.AI, cs.CC, cs.LG, stat.ML

Learning to Think from Multiple Thinkers

arXiv:2604.24737v1 Announce Type: cross
Abstract: We study learning with Chain-of-Thought (CoT) supervision from multiple thinkers, all of whom provide correct but possibly systematically different solutions, e.g., step-by-step solutions to math probl…