cs.LG, math.OC

A Non-Monotone Preconditioned Trust-Region Method for Neural Network Training

arXiv:2605.14860v1 Announce Type: cross
Abstract: Training deep neural networks at scale can benefit from domain decomposition, where the network is split into subdomains trained in parallel and coupled by a global trust-region mechanism. Building on …