Gabriel Peyr\'e - Provide.ai

Muon Dynamics as a Spectral Wasserstein Flow

Gabriel Peyr\'e / April 7, 2026

arXiv:2604.04891v1 Announce Type: cross
Abstract: Gradient normalization is central in deep-learning optimization because it stabilizes training and reduces sensitivity to scale. For deep architectures, parameters are naturally grouped into matrices o…

Author name: Gabriel Peyr\'e

Muon Dynamics as a Spectral Wasserstein Flow