A Layer Separation Optimization Framework for Cross-Entropy Training in Deep Learning
arXiv:2604.23225v1 Announce Type: new
Abstract: This paper investigates the deep learning optimization problem with softmax cross-entropy loss. We propose a layer separation strategy to alleviate the strong nonconvexity encountered during training dee…