The Detection–Extraction Gap: Models Know the Answer Before They Can Say It
arXiv:2604.06613v1 Announce Type: cross
Abstract: Modern reasoning models continue generating long after the answer is already determined. Across five model configurations, two families, and three benchmarks, we find that \textbf{52–88\% of chain-of-…