cs.AI, cs.LG

G-Drift MIA: Membership Inference via Gradient-Induced Feature Drift in LLMs

arXiv:2604.00419v1 Announce Type: cross
Abstract: Large language models (LLMs) are trained on massive web-scale corpora, raising growing concerns about privacy and copyright. Membership inference attacks (MIAs) aim to determine whether a given example…