cs.LG

Practical Scaling Laws: Converting Compute into Performance in a Data-Constrained World

arXiv:2605.09189v1 Announce Type: new
Abstract: The scaling laws guiding modern model training were calibrated for a single regime: data-rich, single-epoch pretraining. The dominant such scaling law form, Chinchilla’s $L = E + A/N^\alpha + B/D^\beta$,…