CRISP: Compressed Reasoning via Iterative Self-Policy Distillation
arXiv:2603.05433v5 Announce Type: replace
Abstract: Reasoning models think out loud, but much of what they say is noise. We introduce CRISP (Compressed Reasoning via Iterative Self-Policy Distillation), a method that teaches models to reason more conc…