cs.LG

Neural Co-state Policies: Structuring Hidden States in Recurrent Reinforcement Learning

arXiv:2605.05373v1 Announce Type: new
Abstract: A key capability of intelligent agents is operating under partial observability: reasoning and acting effectively despite missing or incomplete state observations. While recurrent (memory-based) policies…