Beyond Majority Voting: Efficient Best-Of-N with Radial Consensus Score
arXiv:2604.12196v1 Announce Type: new
Abstract: Large language models (LLMs) frequently generate multiple candidate responses for a given prompt, yet selecting the most reliable one remains challenging, especially when correctness diverges from surfac…