cs.LG, stat.ML

Efficient learning by implicit exploration in bandit problems with side observations

arXiv:2604.24555v1 Announce Type: cross
Abstract: We consider online learning problems under a partial observability model capturing situations where the information conveyed to the learner is between full information and bandit feedback. In the simpl…