cs.AI, cs.LG, stat.ME, stat.ML

When Should an AI Workflow Release? Always-Valid Inference for Black-Box Generate-Verify Systems

arXiv:2605.12947v1 Announce Type: new
Abstract: LLM-enabled AI workflows increasingly produce outputs through iterative generate-evaluate-revise loops. Each iteration can improve the candidate, but it also creates a release decision: when to stop and …