cs.LG, stat.ML

Closed-Form Last Layer Optimization

arXiv:2510.04606v2 Announce Type: replace
Abstract: Neural networks are typically optimized with variants of stochastic gradient descent. Under a squared loss, however, the optimal solution to the linear last layer weights is known in closed-form. We …