cs.AI, cs.LG

Revisiting Mixture Policies in Entropy-Regularized Actor-Critic

arXiv:2605.09157v1 Announce Type: new
Abstract: Mixture policies theoretically offer greater flexibility than unimodal policies in continuous action reinforcement learning, but the practical benefits of this complexity remain elusive. Mixture policies…