Distribution Matching Distillation Meets Reinforcement Learning
arXiv:2511.13649v4 Announce Type: replace
Abstract: Distribution Matching Distillation (DMD) facilitates efficient inference by distilling multi-step diffusion models into few-step variants. Concurrently, Reinforcement Learning (RL) has emerged as a v…