DARLING: Detection Augmented Reinforcement Learning with Non-Stationary Guarantees
arXiv:2604.16684v1 Announce Type: cross
Abstract: We study model-free reinforcement learning (RL) in non-stationary finite-horizon episodic Markov decision processes (MDPs) without prior knowledge of the non-stationarity. We focus on the piecewise-sta…