Vishal Pramanik, Maisha Maliha, Nathaniel D. Bastian, Sumit Kumar Jha

Hessian-Enhanced Token Attribution (HETA): Interpreting Autoregressive LLMs

Vishal Pramanik, Maisha Maliha, Nathaniel D. Bastian, Sumit Kumar Jha / April 16, 2026

arXiv:2604.13258v1 Announce Type: cross
Abstract: Attribution methods seek to explain language model predictions by quantifying the contribution of input tokens to generated outputs. However, most existing techniques are designed for encoder-based arc…

Author name: Vishal Pramanik, Maisha Maliha, Nathaniel D. Bastian, Sumit Kumar Jha

Hessian-Enhanced Token Attribution (HETA): Interpreting Autoregressive LLMs