Author name: Abhishek Gupta, Aditya Mahajan

Operator-Theoretic Foundations and Policy Gradient Methods for General MDPs with Unbounded Costs

Abhishek Gupta, Aditya Mahajan / April 1, 2026

arXiv:2603.17875v3 Announce Type: replace
Abstract: Markov decision processes (MDPs) is viewed as an optimization of an objective function over certain linear operators over general function spaces. A new existence result is established for the existe…

cs.LG, math.OC

Operator-Theoretic Foundations and Policy Gradient Methods for General MDPs with Unbounded Costs

Abhishek Gupta, Aditya Mahajan / March 26, 2026

arXiv:2603.17875v2 Announce Type: replace
Abstract: Markov decision processes (MDPs) is viewed as an optimization of an objective function over certain linear operators over general function spaces. A new existence result is established for the existe…