cs.LG

From Curiosity to Caution: Mitigating Reward Hacking for Best-of-N with Pessimism

arXiv:2604.04648v1 Announce Type: new
Abstract: Inference-time compute scaling has emerged as a powerful paradigm for improving language model performance on a wide range of tasks, but the question of how best to use the additional compute remains ope…