cs.LG, cs.RO

Fisher Decorator: Refining Flow Policy via A Local Transport Map

arXiv:2604.17919v1 Announce Type: cross
Abstract: Recent advances in flow-based offline reinforcement learning (RL) have achieved strong performance by parameterizing policies via flow matching. However, they still face critical trade-offs among expre…