cs.CL, cs.LG

Self-Calibrating Language Models via Test-Time Discriminative Distillation

arXiv:2604.09624v1 Announce Type: new
Abstract: Large language models (LLMs) are systematically overconfident: they routinely express high certainty on questions they often answer incorrectly. Existing calibration methods either require labeled valida…