Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling
arXiv:2604.27039v1 Announce Type: new
Abstract: Token serves as the fundamental unit of computation in modern autoregressive models, and generation length directly influences both inference cost and reasoning performance. Despite its importance, exist…