cs.AI, cs.LG

Intentional Updates for Streaming Reinforcement Learning

arXiv:2604.19033v1 Announce Type: new
Abstract: In gradient-based learning, a step size chosen in parameter units does not produce a predictable per-step change in function output. This often leads to instability in the streaming setting (i.e., batch …