cs.LG

MDPs with a State Sensing Cost

arXiv:2505.03280v3 Announce Type: replace
Abstract: In many practical sequential decision-making problems, tracking the state of the environment incurs a sensing/communication/computation cost. In these settings, the agent’s interaction with its envir…