Contextual Bandits for Resource-Constrained Devices using Probabilistic Learning
arXiv:2605.13346v1 Announce Type: new
Abstract: Contextual bandits (CB) are online sequential decision-making problems under partial feedback that underpin many adaptive services. There is a growing demand to deploy CB agents directly on-device, under…