Artemy Rubtsov, Sergey Samsonov, Vladimir Ulyanov, Alexey Naumov

Gaussian Approximation for Asynchronous Q-learning

Artemy Rubtsov, Sergey Samsonov, Vladimir Ulyanov, Alexey Naumov / April 9, 2026

arXiv:2604.07323v1 Announce Type: new
Abstract: In this paper, we derive rates of convergence in the high-dimensional central limit theorem for Polyak-Ruppert averaged iterates generated by the asynchronous Q-learning algorithm with a polynomial steps…

Author name: Artemy Rubtsov, Sergey Samsonov, Vladimir Ulyanov, Alexey Naumov

Gaussian Approximation for Asynchronous Q-learning