cs.AI

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

arXiv:2605.02290v1 Announce Type: new
Abstract: Distilling large reasoning models is essential for making Long-CoT reasoning practical, as full-scale inference remains computationally prohibitive. Existing curation-based approaches select complete rea…