Wonseok Yang, Thinh T. Doan

Internal State-Based Policy Gradient Methods for Partially Observable Markov Potential Games

Wonseok Yang, Thinh T. Doan / April 2, 2026

arXiv:2604.00433v1 Announce Type: cross
Abstract: This letter studies multi-agent reinforcement learning in partially observable Markov potential games. Solving this problem is challenging due to partial observability, decentralized information, and t…

Author name: Wonseok Yang, Thinh T. Doan

Internal State-Based Policy Gradient Methods for Partially Observable Markov Potential Games