Decoupled Descent: Exact Test Error Tracking Via Approximate Message Passing
arXiv:2604.27883v1 Announce Type: cross
Abstract: In modern parametric model training, full-batch gradient descent (and its variants) suffers due to progressively stronger biasing towards the exact realization of training data; this drives the systema…