cs.LG, math.ST, stat.ML, stat.TH

Policy Testing in Markov Decision Processes

arXiv:2505.15342v2 Announce Type: replace
Abstract: We study the policy testing problem in discounted Markov decision processes (MDPs) in the fixed-confidence setting under a generative model with static sampling. The goal is to decide whether the val…