Weakly Supervised Distillation of Hallucination Signals into Transformer Representations
arXiv:2604.06277v1 Announce Type: new
Abstract: Existing hallucination detection methods for large language models (LLMs) rely on external verification at inference time, requiring gold answers, retrieval systems, or auxiliary judge models. We ask whe…