Retrieval-of-Thought: Efficient Reasoning via Reusing Thoughts
arXiv:2509.21743v2 Announce Type: replace
Abstract: Large reasoning models improve accuracy by producing long reasoning traces, but this inflates latency and cost, motivating inference-time efficiency. We propose Retrieval-of-Thought (RoT), which reus…