cs.LG

AGOP as Explanation: From Feature Learning to Per-Sample Attribution in Image Classifiers

arXiv:2605.12816v1 Announce Type: new
Abstract: The Average Gradient Outer Product (AGOP) governs feature learning in neural networks: the Neural Feature Ansatz states that weight Gram matrices at each layer align with the corresponding AGOP matrices …