Policy Testing in Markov Decision Processes
arXiv:2505.15342v2 Announce Type: replace
Abstract: We study the policy testing problem in discounted Markov decision processes (MDPs) in the fixed-confidence setting under a generative model with static sampling. The goal is to decide whether the val…