cs.CV

DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

arXiv:2512.15713v3 Announce Type: replace
Abstract: Diffusion-based decoding has recently emerged as an appealing alternative to autoregressive (AR) generation, offering the potential to update multiple tokens in parallel and reduce latency. However, …