Escape dynamics and implicit bias of one-pass SGD in overparameterized quadratic networks
arXiv:2604.03068v1 Announce Type: cross
Abstract: We analyze the one-pass stochastic gradient descent dynamics of a two-layer neural network with quadratic activations in a teacher–student framework. In the high-dimensional regime, where the input di…