Nahyun Lee, Guijin Son

Pushing the Boundaries of Multiple Choice Evaluation to One Hundred Options

Nahyun Lee, Guijin Son / April 17, 2026

arXiv:2604.14634v1 Announce Type: new
Abstract: Multiple choice evaluation is widely used for benchmarking large language models, yet near ceiling accuracy in low option settings can be sustained by shortcut strategies that obscure true competence. Th…

Author name: Nahyun Lee, Guijin Son

Pushing the Boundaries of Multiple Choice Evaluation to One Hundred Options