cs.CR, cs.CV

CoLA: A Choice Leakage Attack Framework to Expose Privacy Risks in Subset Training

arXiv:2604.12342v1 Announce Type: cross
Abstract: Training models on a carefully chosen portion of data rather than the full dataset is now a standard preprocess for modern ML. From vision coreset selection to large-scale filtering in language models,…