cs.AI, cs.LG

The Signal is in the Steps: Local Scoring for Reasoning Data Selection

arXiv:2510.03988v2 Announce Type: replace-cross
Abstract: Distilling long-form reasoning from teacher models into smaller students requires selecting which candidate solutions to train on. Recent work argues that one should select responses the studen…