Minzhu Tu, Shiyu Ni, Keping Bi

How Long Reasoning Chains Influence LLMs’ Judgment of Answer Factuality

Minzhu Tu, Shiyu Ni, Keping Bi / April 9, 2026

arXiv:2604.06756v1 Announce Type: new
Abstract: Large language models (LLMs) has been widely adopted as a scalable surrogate for human evaluation, yet such judges remain imperfect and susceptible to surface-level biases. One possible reason is that th…

Author name: Minzhu Tu, Shiyu Ni, Keping Bi

How Long Reasoning Chains Influence LLMs’ Judgment of Answer Factuality