Internal State-Based Policy Gradient Methods for Partially Observable Markov Potential Games
arXiv:2604.00433v1 Announce Type: cross
Abstract: This letter studies multi-agent reinforcement learning in partially observable Markov potential games. Solving this problem is challenging due to partial observability, decentralized information, and t…