cs.AI, cs.CL

Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models

arXiv:2510.20351v2 Announce Type: replace
Abstract: Large language models (LLMs) are increasingly exposed to data contamination, i.e., performance gains driven by prior exposure of test datasets rather than generalization. However, in the context of t…