Joshua Ward, Bochao Gu, Chi-Hua Wang, Guang Cheng

When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation

Joshua Ward, Bochao Gu, Chi-Hua Wang, Guang Cheng / May 12, 2026

arXiv:2512.08875v2 Announce Type: replace
Abstract: Large Language Models (LLMs) have recently demonstrated remarkable performance in generating high-quality tabular synthetic data. In practice, two primary approaches have emerged for adapting LLMs to…

Author name: Joshua Ward, Bochao Gu, Chi-Hua Wang, Guang Cheng

When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation