Geometry-Aware Offline-to-Online Learning in Linear Contextual Bandits
arXiv:2604.24016v1 Announce Type: new
Abstract: We study offline-to-online learning in linear contextual bandits with biased offline regression data: the offline parameter need not match the online one, so history should not be treated as a single war…