On the Power of Adaptivity for $\varepsilon$-Best Arm Identification in Linear Bandits
arXiv:2605.15663v1 Announce Type: new
Abstract: We study the minimax sample complexity of $\varepsilon$-best arm identification in linear bandits. Given a compact action set $\mathcal{X}$ that spans $\mathbb{R}^d$ and an unknown reward vector $\theta\…