Reliable Chain-of-Thought via Prefix Consistency
arXiv:2605.07654v1 Announce Type: cross
Abstract: Large Language Models often improve accuracy on reasoning tasks by sampling multiple Chain-of-Thought (CoT) traces and aggregating them with majority voting (MV), a test-time technique called self-cons…