Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach
arXiv:2502.03725v2 Announce Type: replace
Abstract: We present a novel machine learning framework for the optimal control of fluid restless multi-armed bandit problems (FRMABPs) with state equations that are either affine or quadratic in the state var…