Factorization-Error-Free Discrete Diffusion Language Model via Speculative Decoding
arXiv:2605.14305v1 Announce Type: new
Abstract: Discrete diffusion language models improve generation efficiency through parallel token prediction, but standard $X_0$ prediction methods introduce factorization errors by approximating the clean token p…