cs.AI, cs.CV, cs.LG, cs.NE

It’s Not a Lottery, It’s a Race: Understanding How Gradient Descent Adapts the Network’s Capacity to the Task

arXiv:2602.04832v2 Announce Type: replace
Abstract: Our theoretical understanding of neural networks is lagging behind their empirical success. One of the important unexplained phenomena is why and how, during the process of training with gradient des…