STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning
arXiv:2506.18831v2 Announce Type: replace
Abstract: Large Language Models employing extended chain-of-thought (CoT) reasoning often suffer from the overthinking phenomenon, generating excessive and redundant reasoning steps that increase computational…