Data Deletion Can Help in Adaptive RL
arXiv:2605.00298v1 Announce Type: new
Abstract: Deploying reinforcement learning policies in the real world requires adapting to time-varying environments. We study this problem in the contextual Markov Decision Process (cMDP) framework, where a famil…