cs.LG

On the Convergence Behavior of Preconditioned Gradient Descent Toward the Rich Learning Regime

arXiv:2601.03162v2 Announce Type: replace
Abstract: Spectral bias, the tendency of neural networks to learn low frequencies first, can be both a blessing and a curse. While it enhances the generalization capabilities by suppressing high-frequency nois…