cs.LG

Language Modeling with Hyperspherical Flows

arXiv:2605.11125v1 Announce Type: new
Abstract: Discrete Diffusion Language Models progressed rapidly as an alternative to autoregressive (AR) models, motivated by their parallel generation abilities. However, for tractability, discrete diffusion mode…