Chenghao Sun, Chengsheng Zhang, Guanzheng Qin, Rui Dai, Xinmei Tian

Weight Patching: Toward Source-Level Mechanistic Localization in LLMs

Chenghao Sun, Chengsheng Zhang, Guanzheng Qin, Rui Dai, Xinmei Tian / April 17, 2026

arXiv:2604.13694v1 Announce Type: new
Abstract: Mechanistic interpretability seeks to localize model behavior to the internal components that causally realize it. Prior work has advanced activation-space localization and causal tracing, but modules th…

Author name: Chenghao Sun, Chengsheng Zhang, Guanzheng Qin, Rui Dai, Xinmei Tian

Weight Patching: Toward Source-Level Mechanistic Localization in LLMs