Atahan Dokme, Sriram Vishwanath

Interpreting Video Representations with Spatio-Temporal Sparse Autoencoders

Atahan Dokme, Sriram Vishwanath / April 7, 2026

arXiv:2604.03919v1 Announce Type: new
Abstract: We present the first systematic study of Sparse Autoencoders (SAEs) on video representations. Standard SAEs decompose video into interpretable, monosemantic features but destroy temporal coherence: hard …

Author name: Atahan Dokme, Sriram Vishwanath

Interpreting Video Representations with Spatio-Temporal Sparse Autoencoders