cs.AI

Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization

arXiv:2508.10164v2 Announce Type: replace
Abstract: Recent advances in Large Reasoning Models (LRMs) have demonstrated strong performance on complex tasks through long Chain-of-Thought (CoT) reasoning. However, their lengthy outputs increase computati…