When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation
arXiv:2512.08875v2 Announce Type: replace
Abstract: Large Language Models (LLMs) have recently demonstrated remarkable performance in generating high-quality tabular synthetic data. In practice, two primary approaches have emerged for adapting LLMs to…