E-TCAV: Formalizing Penultimate Proxies for Efficient Concept Based Interpretability
arXiv:2605.10261v1 Announce Type: new
Abstract: TCAV (Testing with Concept Activation Vectors) is an interpretability method that assesses the alignment between the internal representations of a trained neural network and human-understandable, high-le…