Guiding Distribution Matching Distillation with Gradient-Based Reinforcement Learning
arXiv:2604.19009v1 Announce Type: new
Abstract: Diffusion distillation, exemplified by Distribution Matching Distillation (DMD), has shown great promise in few-step generation but often sacrifices quality for sampling speed. While integrating Reinforc…