cs.AI, cs.CL

Infinite Mask Diffusion for Few-Step Distillation

arXiv:2605.10518v1 Announce Type: cross
Abstract: Masked Diffusion Models (MDMs) have emerged as a promising alternative to autoregressive models in language modeling, offering the advantages of parallel decoding and bidirectional context processing w…