A multilingual hallucination benchmark: MultiWikiQHalluA
arXiv:2605.02504v1 Announce Type: new
Abstract: Most hallucination evaluations focus on English, leaving it unclear whether findings transfer to lower-resource languages. We investigate faithfulness hallucinations, defined as model-generated content t…