cs.AI, cs.LG

Dissecting Discrete Soft Actor-Critic: Limitations and Principled Alternatives

arXiv:2509.09838v2 Announce Type: replace
Abstract: While Soft Actor-Critic (SAC) is highly effective in continuous control, its discrete counterpart (DSAC) performs poorly on challenging discrete-action domains such as Atari. Consequently, starting f…