Qinhao Chen, Linyang He, Nima Mesgarani

Prune, Interpret, Evaluate: A Cross-Layer Transcoder-Native Framework for Efficient Circuit Discovery via Feature Attribution

Qinhao Chen, Linyang He, Nima Mesgarani / April 21, 2026

arXiv:2604.16889v1 Announce Type: new
Abstract: Existing feature-interpretation pipelines typically operate on uniformly sampled units, but only a small fraction of cross-layer transcoder (CLT) features matter for a target behavior, with the rest resu…

Author name: Qinhao Chen, Linyang He, Nima Mesgarani

Prune, Interpret, Evaluate: A Cross-Layer Transcoder-Native Framework for Efficient Circuit Discovery via Feature Attribution