The Convergence Gap: Instruction-Tuned Language Models Stabilize Later in the Forward Pass
arXiv:2605.07282v1 Announce Type: new
Abstract: Final outputs hide when a checkpoint commits to its next-token prediction. We introduce the convergence gap, a model-diffing diagnostic that decodes each layer’s next-token distribution and measures its …