cs.LG

Optimal Rates for Generalization of Gradient Descent for Deep ReLU Classification

arXiv:2510.02779v3 Announce Type: replace
Abstract: Recent advances have significantly improved our understanding of the generalization performance of gradient descent (GD) methods in deep neural networks. A natural and fundamental question is whether…