cs.CL, cs.CR

Behavioral Canaries: Auditing Private Retrieved Context Usage in RL Fine-Tuning

arXiv:2604.22191v1 Announce Type: cross
Abstract: In agentic workflows, LLMs frequently process retrieved contexts that are legally protected from further training. However, auditors currently lack a reliable way to verify if a provider has violated t…