Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting
arXiv:2512.23805v2 Announce Type: replace-cross
Abstract: Fitted Q-evaluation (FQE) is a foundational method for off-policy evaluation in reinforcement learning, but existing theory typically relies on Bellman completeness of the function class, a con…