Narim Jeong, Donghwan Lee

Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games

Narim Jeong, Donghwan Lee / April 7, 2026

arXiv:2604.04394v1 Announce Type: new
Abstract: Reinforcement learning has been successful both empirically and theoretically in single-agent settings, but extending these results to multi-agent reinforcement learning in general-sum Markov games remai…

Author name: Narim Jeong, Donghwan Lee

Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games