Spectral Lens: Activation and Gradient Spectra as Diagnostics of LLM Optimization
arXiv:2605.05683v1 Announce Type: new
Abstract: Training loss and throughput can hide distinct internal representation in language-model training. To examine these hidden mechanics, we use spectral measurements as practical and operational diagnostics…