Intentional Updates for Streaming Reinforcement Learning
arXiv:2604.19033v1 Announce Type: new
Abstract: In gradient-based learning, a step size chosen in parameter units does not produce a predictable per-step change in function output. This often leads to instability in the streaming setting (i.e., batch …