cs.AI, cs.LG

Soft Deterministic Policy Gradient with Gaussian Smoothing

arXiv:2605.06228v1 Announce Type: cross
Abstract: Deterministic policy gradient (DPG) is widely utilized for continuous control; however, it inherently relies on the differentiability of the critic with respect to the action during policy updates. Thi…