cs.LG, cs.RO

Quantile-Coupled Flow Matching for Distributional Reinforcement Learning

arXiv:2605.08515v1 Announce Type: new
Abstract: Unlike standard expected-return Reinforcement Learning (RL), Distributional RL (DRL) models the full return distribution, making it better-suited for uncertainty-aware and risk-sensitive decision-making….