Leonardo Bertolazzi, Sandro Pezzelle, Raffaella Bernardi

How Language Models Conflate Logical Validity with Plausibility: A Representational Analysis of Content Effects

Leonardo Bertolazzi, Sandro Pezzelle, Raffaella Bernardi / April 21, 2026

arXiv:2510.06700v3 Announce Type: replace
Abstract: Both humans and large language models (LLMs) exhibit content effects: biases in which the plausibility of the semantic content of a reasoning problem influences judgments regarding its logical validi…

Author name: Leonardo Bertolazzi, Sandro Pezzelle, Raffaella Bernardi

How Language Models Conflate Logical Validity with Plausibility: A Representational Analysis of Content Effects