cs.CL

How Language Models Conflate Logical Validity with Plausibility: A Representational Analysis of Content Effects

arXiv:2510.06700v3 Announce Type: replace
Abstract: Both humans and large language models (LLMs) exhibit content effects: biases in which the plausibility of the semantic content of a reasoning problem influences judgments regarding its logical validi…