Fisher Decorator: Refining Flow Policy via A Local Transport Map
arXiv:2604.17919v1 Announce Type: cross
Abstract: Recent advances in flow-based offline reinforcement learning (RL) have achieved strong performance by parameterizing policies via flow matching. However, they still face critical trade-offs among expre…