cs.LG, math.OC, math.PR

Wasserstein Formulation of Reinforcement Learning. An Optimal Transport Perspective on Policy Optimization

arXiv:2604.14765v1 Announce Type: new
Abstract: We present a geometric framework for Reinforcement Learning (RL) that views policies as maps into the Wasserstein space of action probabilities. First, we define a Riemannian structure induced by station…