cond-mat.dis-nn, cond-mat.stat-mech, stat.ML

Escape dynamics and implicit bias of one-pass SGD in overparameterized quadratic networks

arXiv:2604.03068v1 Announce Type: cross
Abstract: We analyze the one-pass stochastic gradient descent dynamics of a two-layer neural network with quadratic activations in a teacher–student framework. In the high-dimensional regime, where the input di…