Han-Dong Lim, HyeAnn Lee, Donghwan Lee

Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ

Han-Dong Lim, HyeAnn Lee, Donghwan Lee / March 31, 2026

arXiv:2402.11877v2 Announce Type: replace
Abstract: Reinforcement learning has witnessed significant advancements, particularly with the emergence of model-based approaches. Among these, $Q$-learning has proven to be a powerful algorithm in model-free…

Author name: Han-Dong Lim, HyeAnn Lee, Donghwan Lee

Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ