Dimitris Bertsimas, Cheol Woo Kim, Jos\'e Ni\~no-Mora

Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach

Dimitris Bertsimas, Cheol Woo Kim, Jos\'e Ni\~no-Mora / May 8, 2026

arXiv:2502.03725v2 Announce Type: replace
Abstract: We present a novel machine learning framework for the optimal control of fluid restless multi-armed bandit problems (FRMABPs) with state equations that are either affine or quadratic in the state var…

Author name: Dimitris Bertsimas, Cheol Woo Kim, Jos\'e Ni\~no-Mora

Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach