Towards Efficient and Expressive Offline RL via Flow-Anchored Noise-conditioned Q-Learning
arXiv:2605.01663v1 Announce Type: cross
Abstract: We propose Flow-Anchored Noise-conditioned Q-Learning (FAN), a highly efficient and high-performing offline reinforcement learning (RL) algorithm. Recent work has shown that expressive flow policies an…