cs.AI, cs.LG

Unveiling m-Sharpness Through the Structure of Stochastic Gradient Noise

arXiv:2509.18001v5 Announce Type: replace
Abstract: Sharpness-aware minimization (SAM) has emerged as a highly effective technique to improve model generalization, but its underlying principles are not fully understood. We investigate m-sharpness, whe…