Nearly Optimal Best Arm Identification for Semiparametric Bandits
arXiv:2604.03969v1 Announce Type: new
Abstract: We study fixed-confidence Best Arm Identification (BAI) in semiparametric bandits, where rewards are linear in arm features plus an unknown additive baseline shift. Unlike linear-bandit BAI, this setting…