CITE: Anytime-Valid Statistical Inference in LLM Self-Consistency
arXiv:2605.05873v1 Announce Type: new
Abstract: Large language models often improve reasoning by sampling multiple outputs and aggregating their final answers, but precise and efficient control of error levels remains a challenging task. In particular…