SoftSAE: Dynamic Top-K Selection for Adaptive Sparse Autoencoders
arXiv:2605.06610v2 Announce Type: replace-cross
Abstract: Sparse Autoencoders (SAEs) have become an important tool in mechanistic interpretability, helping to analyze internal representations in both Large Language Models (LLMs) and Vision Transformer…