A Multi-Level Causal Intervention Framework for Mechanistic Interpretability in Variational Autoencoders
arXiv:2505.03530v3 Announce Type: replace
Abstract: Understanding how generative models represent and transform data is a foundational problem in deep learning interpretability. While mechanistic interpretability of discriminative architectures has yi…