Het Patel, Tiejin Chen, Hua Wei, Evangelos E. Papalexakis, Jia Chen

Are LLM Uncertainty and Correctness Encoded by the Same Features? A Functional Dissociation via Sparse Autoencoders

Het Patel, Tiejin Chen, Hua Wei, Evangelos E. Papalexakis, Jia Chen / April 23, 2026

arXiv:2604.19974v1 Announce Type: new
Abstract: Large language models can be uncertain yet correct, or confident yet wrong, raising the question of whether their output-level uncertainty and their actual correctness are driven by the same internal mec…

Author name: Het Patel, Tiejin Chen, Hua Wei, Evangelos E. Papalexakis, Jia Chen

Are LLM Uncertainty and Correctness Encoded by the Same Features? A Functional Dissociation via Sparse Autoencoders