cs.CL, cs.LG

Learning a Continue-Thinking Token for Enhanced Test-Time Scaling

arXiv:2506.11274v2 Announce Type: replace-cross
Abstract: Test-time scaling has emerged as an effective approach for improving language model performance by utilizing additional compute at inference time. Recent studies have shown that overriding end-…