ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention
arXiv:2603.22016v2 Announce Type: replace-cross
Abstract: Large Reasoning Models (LRMs) often reach a correct solution before their long Chain-of-Thought trace ends, yet continue with redundant verification, repeated attempts, or unnecessary explorati…