cs.LG, stat.ML

Covariance-adapting algorithm for semi-bandits with application to sparse rewards

arXiv:2604.13738v1 Announce Type: cross
Abstract: We investigate stochastic combinatorial semi-bandits, where the entire joint distribution of outcomes impacts the complexity of the problem instance (unlike in the standard bandits). Typical distributi…