cs.LG

Layer Collapse in Diffusion Language Models

arXiv:2605.06366v1 Announce Type: new
Abstract: Diffusion language models (DLMs) have recently emerged as competitive alternatives to autoregressive (AR) language models, yet differences in their activation dynamics remain poorly understood. We charac…