Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models
arXiv:2510.20351v2 Announce Type: replace
Abstract: Large language models (LLMs) are increasingly exposed to data contamination, i.e., performance gains driven by prior exposure of test datasets rather than generalization. However, in the context of t…