cs.LG, stat.ML

Discrete Tilt Matching

arXiv:2604.18739v1 Announce Type: new
Abstract: Masked diffusion large language models (dLLMs) are a promising alternative to autoregressive generation. While reinforcement learning (RL) methods have recently been adapted to dLLM fine-tuning, their ob…