Drifting Field Policy: A One-Step Generative Policy via Wasserstein Gradient Flow
arXiv:2605.07727v1 Announce Type: cross
Abstract: We propose Drifting Field Policy (DFP), a non-ODE one-step generative policy built on the drifting model paradigm. We frame the policy update as a reverse-KL Wasserstein-2 gradient flow toward a soft t…