Lars van der Laan, Nathan Kallus

Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting

Lars van der Laan, Nathan Kallus / April 22, 2026

arXiv:2512.23805v2 Announce Type: replace-cross
Abstract: Fitted Q-evaluation (FQE) is a foundational method for off-policy evaluation in reinforcement learning, but existing theory typically relies on Bellman completeness of the function class, a con…

Author name: Lars van der Laan, Nathan Kallus

Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting