On Global Convergence Rates for Federated Softmax Policy Gradient under Heterogeneous Environments
arXiv:2505.23459v2 Announce Type: replace
Abstract: We provide global convergence rates for vanilla and entropy-regularized federated softmax stochastic policy gradient (FedPG) with local training. We show that FedPG converges to a near-optimal policy…