Hidden Failures in Robustness: Why Supervised Uncertainty Quantification Needs Better Evaluation
arXiv:2604.11662v1 Announce Type: new
Abstract: Recent work has shown that the hidden states of large language models contain signals useful for uncertainty estimation and hallucination detection, motivating a growing interest in efficient probe-based…