cs.AI, cs.LG

Neuron-Anchored Rule Extraction for Large Language Models via Contrastive Hierarchical Ablation

arXiv:2605.03058v1 Announce Type: cross
Abstract: A key goal of explainable AI (XAI) is to express the decision logic of large language models (LLMs) in symbolic form and link it to internal mechanisms. Global rule-extraction methods typically learn s…