Powerful Training-Free Membership Inference Against Autoregressive Language Models
arXiv:2601.12104v2 Announce Type: replace
Abstract: Fine-tuned language models pose significant privacy risks, as they may memorize and expose sensitive information from their training data. Membership inference attacks (MIAs) provide a principled fra…