Measuring and curing reasoning rigidity: from decorative chain-of-thought to genuine faithfulness
arXiv:2603.22816v3 Announce Type: replace
Abstract: Language models increasingly show their work by writing step-by-step reasoning before answering. But are these steps genuinely used, or is the answer rigid – fixed before reasoning begins? We introdu…