Slower Generalization, Faster Memorization: A Sweet Spot in Algorithmic Learning
arXiv:2605.14659v1 Announce Type: new
Abstract: Critical-data-size accounts of grokking suggest a natural post-threshold intuition: once training data is sufficient to identify the underlying rule, additional data should accelerate validation converge…