Thomas Zollo, Jimmy Wang, Richard Zemel

Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation

Thomas Zollo, Jimmy Wang, Richard Zemel / April 22, 2026

arXiv:2604.19444v1 Announce Type: new
Abstract: Reasoning language models can solve increasingly complex tasks, but struggle to produce the calibrated confidence estimates necessary for reliable deployment. Existing calibration methods usually depend …

Author name: Thomas Zollo, Jimmy Wang, Richard Zemel

Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation