Alexander Tyurin, Andrei Spiridonov, Varvara Rudenko

Asynchronous Policy Gradient Aggregation for Efficient Distributed Reinforcement Learning

Alexander Tyurin, Andrei Spiridonov, Varvara Rudenko / March 31, 2026

arXiv:2509.24305v2 Announce Type: replace
Abstract: We study distributed reinforcement learning (RL) with policy gradient methods under asynchronous and parallel computations and communications. While non-distributed methods are well understood theore…

Author name: Alexander Tyurin, Andrei Spiridonov, Varvara Rudenko

Asynchronous Policy Gradient Aggregation for Efficient Distributed Reinforcement Learning