PC3D: Zero-Shot Cooperation Across Variable Rosters via Personalized Context Distillation
arXiv:2605.10377v1 Announce Type: new
Abstract: Cooperative multi-agent reinforcement learning often assumes a fixed execution team, yet many decentralized systems must operate with varying numbers of active agents during deployment. We study this set…