Infinite Mask Diffusion for Few-Step Distillation
arXiv:2605.10518v1 Announce Type: cross
Abstract: Masked Diffusion Models (MDMs) have emerged as a promising alternative to autoregressive models in language modeling, offering the advantages of parallel decoding and bidirectional context processing w…