cs.LG, stat.ML

Guided Speculative Inference for Efficient Test-Time Alignment of LLMs

arXiv:2506.04118v3 Announce Type: replace-cross
Abstract: We propose Guided Speculative Inference (GSI), a novel algorithm for efficient reward-guided decoding in large language models. GSI combines soft best-of-$n$ test-time scaling with a reward mod…