Belief-State RWKV for Reinforcement Learning under Partial Observability
arXiv:2604.09671v1 Announce Type: new
Abstract: We propose a stronger formulation of RL on top of RWKV-style recurrent sequence models, in which the fixed-size recurrent state is explicitly interpreted as a belief state rather than an opaque hidden ve…