Author name: William Walden, Miriam Wanner

Reasoning Models Will Sometimes Lie About Their Reasoning

William Walden, Miriam Wanner / April 22, 2026

arXiv:2601.07663v4 Announce Type: replace
Abstract: Hint-based faithfulness evaluations have established that Large Reasoning Models (LRMs) may not say what they think: they do not always volunteer information about how key parts of the input (e.g. an…

cs.AI, cs.CL

Reasoning Models Will Sometimes Lie About Their Reasoning

William Walden, Miriam Wanner / April 13, 2026

arXiv:2601.07663v3 Announce Type: replace-cross
Abstract: Hint-based faithfulness evaluations have established that Large Reasoning Models (LRMs) may not say what they think: they do not always volunteer information about how key parts of the input (e…