cs.AI, cs.LG, math.ST, stat.ME, stat.ML, stat.TH

CITE: Anytime-Valid Statistical Inference in LLM Self-Consistency

arXiv:2605.05873v1 Announce Type: new
Abstract: Large language models often improve reasoning by sampling multiple outputs and aggregating their final answers, but precise and efficient control of error levels remains a challenging task. In particular…