cs.AI

DataDignity: Training Data Attribution for Large Language Models

arXiv:2605.05687v1 Announce Type: new
Abstract: Auditing language-model outputs often requires more than judging correctness: an auditor may need to identify which source document most likely supports the knowledge expressed in a response. We study th…