Hitting Time Isomorphism for Multi-Stage Planning with Foundation Policies
arXiv:2605.06470v1 Announce Type: new
Abstract: We present a new operator-theoretic representation learning framework for offline reinforcement learning that recovers the directed temporal geometry of a controlled Markov process from hitting time obse…