cs.AI, cs.LG

Efficient LLM Reasoning via Variational Posterior Guidance with Efficiency Awareness

arXiv:2605.11019v1 Announce Type: new
Abstract: Although large language models rely on chain-of-thought for complex reasoning, the overthinking phenomenon severely degrades inference efficiency. Existing reinforcement learning methods compress reasoni…