cs.AI, cs.LG, cs.MA

CAPO: Counterfactual Credit Assignment in Sequential Cooperative Teams

arXiv:2604.17693v1 Announce Type: new
Abstract: In cooperative teams where agents act in a fixed order and share a single team reward, it is hard to know how much each agent contributed, and harder still when agents are updated one at a time because d…