Author name: Harin Lee, Min-hwan Oh

Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning

Harin Lee, Min-hwan Oh / May 8, 2026

arXiv:2605.05102v2 Announce Type: replace-cross
Abstract: We study the distribution of regret in stochastic multi-armed bandits and episodic reinforcement learning through a unified framework. We formalize a distributional regret bound as a probabilis…

cs.LG, stat.ML

Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning

Harin Lee, Min-hwan Oh / May 7, 2026

arXiv:2605.05102v1 Announce Type: cross
Abstract: We study the distribution of regret in stochastic multi-armed bandits and episodic reinforcement learning through a unified framework. We formalize a distributional regret bound as a probabilistic guar…