Robust Parameter Learning for Uncertain MDPs
arXiv:2605.01339v1 Announce Type: new
Abstract: Learning-based approaches to verifying unknown Markov decision processes (MDPs) often employ uncertain MDPs. These models use, for example, confidence intervals to capture transition uncertainty and allo…