cs.LG, cs.SY, eess.SY

Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games

arXiv:2604.04394v1 Announce Type: new
Abstract: Reinforcement learning has been successful both empirically and theoretically in single-agent settings, but extending these results to multi-agent reinforcement learning in general-sum Markov games remai…