Language Models Can Explain Visual Features via Steering
arXiv:2603.22593v2 Announce Type: replace-cross
Abstract: Sparse Autoencoders uncover thousands of features in vision models, yet explaining these features without requiring human intervention remains an open challenge. While previous work has propose…