Julien Seznec, Pierre M\'enard, Alessandro Lazaric, Michal Valko

A single algorithm for both restless and rested rotting bandits

Julien Seznec, Pierre M\'enard, Alessandro Lazaric, Michal Valko / April 24, 2026

arXiv:2604.21432v1 Announce Type: new
Abstract: In many application domains (e.g., recommender systems, intelligent tutoring systems), the rewards associated to the actions tend to decrease over time. This decay is either caused by the actions execute…

Author name: Julien Seznec, Pierre M\'enard, Alessandro Lazaric, Michal Valko

A single algorithm for both restless and rested rotting bandits