On the Importance of Multistability for Horizon Generalization in Reinforcement Learning
arXiv:2605.12206v1 Announce Type: new
Abstract: In reinforcement learning (RL), agents acting in partially observable Markov decision processes (POMDPs) must rely on memory, typically encoded in a recurrent neural network (RNN), to integrate informati…