cs.LG

Neural Garbage Collection: Learning to Forget while Learning to Reason

arXiv:2604.18002v1 Announce Type: new
Abstract: Chain-of-thought reasoning has driven striking advances in language model capability, yet every reasoning step grows the KV cache, creating a bottleneck to scaling this paradigm further. Current approach…