Bilinear autoencoders find interpretable manifolds
arXiv:2605.08891v1 Announce Type: new
Abstract: Sparse autoencoders have become a standard tool for uncovering interpretable latent representations in neural networks. Yet salient concepts often span manifolds that current linear methods cannot captur…