Activation Steering for Masked Diffusion Language Models
arXiv:2512.24143v3 Announce Type: replace
Abstract: Masked diffusion language models (MDLMs) generate text via iterative masked-token denoising, enabling mask-parallel decoding and distinct controllability and efficiency tradeoffs from autoregressive …