Tomas Kocak, Gergely Neu, Michal Valko, Remi Munos

Efficient learning by implicit exploration in bandit problems with side observations

Tomas Kocak, Gergely Neu, Michal Valko, Remi Munos / April 28, 2026

arXiv:2604.24555v1 Announce Type: cross
Abstract: We consider online learning problems under a partial observability model capturing situations where the information conveyed to the learner is between full information and bandit feedback. In the simpl…

Author name: Tomas Kocak, Gergely Neu, Michal Valko, Remi Munos

Efficient learning by implicit exploration in bandit problems with side observations