Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards
arXiv:2510.16187v2 Announce Type: replace-cross
Abstract: Real-world multi-agent systems may require ad hoc teaming, where an agent must coordinate with other previously unseen teammates to solve a task in a zero-shot manner. Prior work often either s…