The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning
arXiv:2603.29025v2 Announce Type: replace
Abstract: Large language models systematically fail when a salient surface cue conflicts with an unstated feasibility constraint. We study this through a diagnose-measure-bridge-treat framework. Causal-behavio…