Accelerating LMO-Based Optimization via Implicit Gradient Transport
arXiv:2605.05577v1 Announce Type: new
Abstract: Recent optimizers such as Lion and Muon have demonstrated strong empirical performance by normalizing gradient momentum via linear minimization oracles (LMOs). While variance reduction has been explored …