Are Flat Minima an Illusion?
arXiv:2605.05209v1 Announce Type: cross
Abstract: Neural networks that land in flat regions of the loss landscape tend to generalise better than those in sharp regions. Sharpness-Aware Minimisation exploits this to improve generalisation. But function…