Neural Co-state Policies: Structuring Hidden States in Recurrent Reinforcement Learning
arXiv:2605.05373v1 Announce Type: new
Abstract: A key capability of intelligent agents is operating under partial observability: reasoning and acting effectively despite missing or incomplete state observations. While recurrent (memory-based) policies…