Task-Aware Calibration: Provably Optimal Decoding in LLMs
arXiv:2605.10202v1 Announce Type: cross
Abstract: LLM decoding often relies on the model’s predictive distribution to generate an output. Consequently, misalignment with respect to the true generating distribution leads to suboptimal decisions in prac…