PLOT: Progressive Localization via Optimal Transport in Neural Causal Abstraction
arXiv:2605.06979v1 Announce Type: cross
Abstract: Causal abstraction offers a principled framework for mechanistic interpretability, aligning a high-level causal model with the low-level computation realized by a neural network through counterfactual …