Author name: Huiling Meng, Ningyuan Chen, Xuefeng Gao

Design Experiments to Compare Multi-armed Bandit Algorithms

Huiling Meng, Ningyuan Chen, Xuefeng Gao / April 14, 2026

arXiv:2603.05919v2 Announce Type: replace
Abstract: Online platforms routinely compare multi-armed bandit algorithms, such as UCB and Thompson Sampling, to select the best-performing policy. Unlike standard A/B tests for static treatments, each run of…

cs.LG, math.OC

Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management

Huiling Meng, Ningyuan Chen, Xuefeng Gao / April 14, 2026

arXiv:2406.05358v3 Announce Type: replace
Abstract: Intensity control is a class of continuous-time dynamic optimization problems with many important applications in Operations Research including queueing and revenue management. In this study, we prop…