On the Divergence of Differential Temporal Difference Learning without Local Clocks
arXiv:2605.06874v1 Announce Type: new
Abstract: Learning rate is a critical component of reinforcement learning (RL). This work uses global and local clocks to distinguish two types of learning rates. The former is of the standard form $\alpha_t$ that…