Julia Hu, Alfred Shen, Kumar Lakshmipathi

Statistical Scouting Finds Debate-Safe but Not Debate-Useful Cases: A Matched-Ceiling Study of Open-Weight LLM Reasoning Protocols

Julia Hu, Alfred Shen, Kumar Lakshmipathi / May 12, 2026

arXiv:2605.09618v1 Announce Type: new
Abstract: When should a language model answer directly, sample and vote, or engage in multi-agent debate? Recent work shows voting often explains much of the gain attributed to debate, while selective-debate syste…

Author name: Julia Hu, Alfred Shen, Kumar Lakshmipathi

Statistical Scouting Finds Debate-Safe but Not Debate-Useful Cases: A Matched-Ceiling Study of Open-Weight LLM Reasoning Protocols