Snehit Vaddi, Pujith Vaddi

Do Hallucination Neurons Generalize? Evidence from Cross-Domain Transfer in LLMs

Snehit Vaddi, Pujith Vaddi / April 23, 2026

arXiv:2604.19765v1 Announce Type: new
Abstract: Recent work identifies a sparse set of “hallucination neurons” (H-neurons), less than 0.1% of feed-forward network neurons, that reliably predict when large language models will hallucinate. These neuron…

Author name: Snehit Vaddi, Pujith Vaddi

Do Hallucination Neurons Generalize? Evidence from Cross-Domain Transfer in LLMs