Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic
arXiv:2601.21972v4 Announce Type: replace
Abstract: Recent work has explored optimizing LLM collaboration through Multi-Agent Reinforcement Learning (MARL). However, most MARL fine-tuning approaches rely on predefined execution protocols, which often …