cs.LG, cs.SY, eess.SY, stat.ML

Cost-Ordered Feasibility for Multi-Armed Bandits with Cost Subsidy

arXiv:2605.07171v1 Announce Type: new
Abstract: The classic multi-armed bandit (MAB) problem tackles the challenge of accruing maximum reward while making decisions under uncertainty. However, in applications, often the goal is to minimize cost subjec…