cs.AI, cs.CR

Protecting the Trace: A Principled Black-Box Approach Against Distillation Attacks

arXiv:2604.23238v1 Announce Type: cross
Abstract: Frontier models push the boundaries of what is learnable at extreme computational costs, yet distillation via sampling reasoning traces exposes closed-source frontier models to adversarial third partie…