cs.AI, cs.LG

Extending Differential Temporal Difference Methods for Episodic Problems

arXiv:2605.04368v1 Announce Type: new
Abstract: Differential temporal difference (TD) methods are value-based reinforcement learning algorithms that have been proposed for infinite-horizon problems. They rely on reward centering, where each reward is …