Worst-Case Regret Bounds for Combinatorial Thompson Sampling in Sleeping Semi-Bandits
arXiv:2605.09277v2 Announce Type: replace
Abstract: We revisit combinatorial Thompson sampling (CTS) for semi-bandits with sleeping arms, where arm availability varies over time and actions must satisfy combinatorial constraints, as in wireless mesh r…