Wenlong Mou - Provide.ai

cs.LG, math.OC, math.ST, stat.ML, stat.TH

Continuous-time reinforcement learning: ellipticity enables model-free value function approximation

Wenlong Mou / April 17, 2026

arXiv:2602.06930v2 Announce Type: replace-cross
Abstract: We study off-policy reinforcement learning for controlling continuous-time Markov diffusion processes with discrete-time observations and actions. We consider model-free algorithms with functio…

Author name: Wenlong Mou

Continuous-time reinforcement learning: ellipticity enables model-free value function approximation