Model-Based Reinforcement Learning under Random Observation Delays
arXiv:2509.20869v2 Announce Type: replace
Abstract: Delays frequently occur in real-world environments, yet standard reinforcement learning (RL) algorithms often assume instantaneous perception of the environment. We study random sensor delays in POMD…