cs.AI

Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction

arXiv:2509.12464v2 Announce Type: replace
Abstract: Reasoning language models such as DeepSeek-R1 produce long chain-of-thought traces during inference time which make them costly to deploy at scale. We show that using compression techniques such as n…