SMIXAE: Towards Unsupervised Manifold Discovery in Language Models
arXiv:2605.09224v1 Announce Type: new
Abstract: Sparse autoencoders (SAEs) have been used widely to decompose and interpret neural network activations, especially those of transformer language models. One key issue with SAEs is their inability to dire…