Ege C. Kaya, Abolfazl Hashemi

A Finite-Iteration Theory for Asynchronous Categorical Distributional Temporal-Difference Learning

Ege C. Kaya, Abolfazl Hashemi / May 11, 2026

arXiv:2605.06866v1 Announce Type: new
Abstract: Recent non-asymptotic analyses have substantially advanced the theory of distributional policy evaluation, but they largely concern synchronous full-state updates under a generative model, model-based es…

Author name: Ege C. Kaya, Abolfazl Hashemi

A Finite-Iteration Theory for Asynchronous Categorical Distributional Temporal-Difference Learning