Zhuohao Yu, Zhiwei Steven Wu, Adam Block

From Curiosity to Caution: Mitigating Reward Hacking for Best-of-N with Pessimism

Zhuohao Yu, Zhiwei Steven Wu, Adam Block / April 7, 2026

arXiv:2604.04648v1 Announce Type: new
Abstract: Inference-time compute scaling has emerged as a powerful paradigm for improving language model performance on a wide range of tasks, but the question of how best to use the additional compute remains ope…

Author name: Zhuohao Yu, Zhiwei Steven Wu, Adam Block

From Curiosity to Caution: Mitigating Reward Hacking for Best-of-N with Pessimism