Wenhong Zhu, Ruobing Xie, Rui Wang, Pengfei Liu

Hybrid Policy Distillation for LLMs

Wenhong Zhu, Ruobing Xie, Rui Wang, Pengfei Liu / April 23, 2026

arXiv:2604.20244v1 Announce Type: new
Abstract: Knowledge distillation (KD) is a powerful paradigm for compressing large language models (LLMs), whose effectiveness depends on intertwined choices of divergence direction, optimization strategy, and dat…

Author name: Wenhong Zhu, Ruobing Xie, Rui Wang, Pengfei Liu

Hybrid Policy Distillation for LLMs