cs.AI, cs.CL, cs.ET

Zero-Shot Confidence Estimation for Small LLMs: When Supervised Baselines Aren’t Worth Training

arXiv:2605.02241v1 Announce Type: cross
Abstract: How reliably can a small language model estimate its own correctness? The answer determines whether local-to-cloud routing-escalating queries a cheap local model cannot handle-can work without supervis…