Bellman Residual Minimization for Control: Geometry, Stationarity, and Convergence
arXiv:2601.18840v3 Announce Type: replace
Abstract: Markov decision problems are most commonly solved via dynamic programming. Another approach is Bellman residual minimization, which directly minimizes the squared Bellman residual objective function….