Sign-Based Optimizers Are Effective Under Heavy-Tailed Noise
arXiv:2602.07425v2 Announce Type: replace-cross
Abstract: While adaptive gradient methods are the workhorse of modern machine learning, sign-based optimization algorithms such as Lion and Muon have recently demonstrated superior empirical performance …