Target-Aligned Reinforcement Learning
arXiv:2603.29501v1 Announce Type: new
Abstract: Many reinforcement learning algorithms rely on target networks – lagged copies of the online network – to stabilize training. While effective, this mechanism introduces a fundamental stability-recency tr…