cs.LG, stat.AP

dFlowGRPO: Rate-Aware Policy Optimization for Discrete Flow Models

arXiv:2605.09291v1 Announce Type: new
Abstract: Discrete flow models (DFMs) are a class of flexible generative models for generating discrete data, and diffusion large language models (dLLMs) can be viewed as a special case with a specific choice of m…