Author name: Mingyang Song, Mao Zheng

A Survey of On-Policy Distillation for Large Language Models

Mingyang Song, Mao Zheng / April 2, 2026

arXiv:2604.00626v1 Announce Type: cross
Abstract: Knowledge distillation has become a primary mechanism for transferring reasoning and domain expertise from frontier Large Language Models (LLMs) to smaller, deployable students. However, the dominant p…

cs.CL

Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions

Mingyang Song, Mao Zheng / March 31, 2026

arXiv:2603.09938v2 Announce Type: replace
Abstract: Model merging combines the parameters of multiple neural networks into a single model without additional training. As fine-tuned large language models (LLMs) proliferate, merging offers a computation…