cs.CL

LogitTrace: Detecting Benchmark Contamination via Layerwise Logit Trajectories

arXiv:2509.20909v2 Announce Type: replace
Abstract: Large language models (LLMs) are commonly evaluated on challenging benchmarks such as AIME and Math500, where benchmark contamination can make memorized solutions appear as genuine reasoning. Existin…