Asynchronous Policy Gradient Aggregation for Efficient Distributed Reinforcement Learning
arXiv:2509.24305v2 Announce Type: replace
Abstract: We study distributed reinforcement learning (RL) with policy gradient methods under asynchronous and parallel computations and communications. While non-distributed methods are well understood theore…