Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
arXiv:2402.11877v2 Announce Type: replace
Abstract: Reinforcement learning has witnessed significant advancements, particularly with the emergence of model-based approaches. Among these, $Q$-learning has proven to be a powerful algorithm in model-free…