Learning a Continue-Thinking Token for Enhanced Test-Time Scaling
arXiv:2506.11274v2 Announce Type: replace-cross
Abstract: Test-time scaling has emerged as an effective approach for improving language model performance by utilizing additional compute at inference time. Recent studies have shown that overriding end-…