DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs
arXiv:2601.03559v2 Announce Type: replace
Abstract: Chain-of-Thought (CoT) reasoning improves multi-step mathematical problem solving in large language models but remains vulnerable to exposure bias and error accumulation, as early mistakes propagate …