cs.LG

Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs

arXiv:2604.01024v1 Announce Type: new
Abstract: We study model-based learning of finite-window policies in tabular partially observable Markov decision processes (POMDPs). A common approach to learning under partial observability is to approximate unb…