Detecting overfitting in Neural Networks during long-horizon grokking using Random Matrix Theory
arXiv:2605.12394v2 Announce Type: replace
Abstract: Training Neural Networks (NNs) without overfitting is difficult; detecting that overfitting is difficult as well. We present a novel Random Matrix Theory method that detects the onset of overfitting …