cs.CL

CRISP: Compressing Redundancy in Chain-of-Thought via Intrinsic Saliency Pruning

arXiv:2604.17297v1 Announce Type: new
Abstract: Long Chain-of-Thought (CoT) reasoning is pivotal for the success of recent reasoning models but suffers from high computational overhead and latency. While prior works attempt to compress CoT via externa…