Cost-Ordered Feasibility for Multi-Armed Bandits with Cost Subsidy
arXiv:2605.07171v1 Announce Type: new
Abstract: The classic multi-armed bandit (MAB) problem tackles the challenge of accruing maximum reward while making decisions under uncertainty. However, in applications, often the goal is to minimize cost subjec…