cs.AI, cs.LG, math.OC, stat.ML

Reinforcement Learning with Markov Risk Measures and Multipattern Risk Approximation

arXiv:2605.00654v1 Announce Type: cross
Abstract: For a risk-averse finite-horizon Markov Decision Problem, we introduce a special class of Markov coherent risk measures, called mini-batch measures. We also define the class of multipattern risk-averse…