cs.LG, stat.ML

Adaptive Estimation and Optimal Control in Offline Contextual MDPs without Stationarity

arXiv:2605.03393v1 Announce Type: cross
Abstract: Contextual MDPs are powerful tools with wide applicability in areas from biostatistics to machine learning. However, specializing them to offline datasets has been challenging due to a lack of robust, …