FACT-E: Causality-Inspired Evaluation for Trustworthy Chain-of-Thought Reasoning
arXiv:2604.10693v1 Announce Type: new
Abstract: Chain-of-Thought (CoT) prompting has improved LLM reasoning, but models often generate explanations that appear coherent while containing unfaithful intermediate steps. Existing self-evaluation approache…