cs.CL

Shorthand for Thought: Compressing LLM Reasoning via Entropy-Guided Supertokens

arXiv:2604.26355v1 Announce Type: new
Abstract: Reasoning in Large Language Models incurs significant inference-time compute, yet the token-level information structure of reasoning traces remains underexplored. We observe that reasoning tokens split i…