Generalization performance of narrow one-hidden layer networks in the teacher-student setting
arXiv:2507.00629v4 Announce Type: replace-cross
Abstract: Understanding the generalization properties of neural networks on simple input-output distributions is key to explaining their performance on real datasets. The classical teacher-student settin…