When Does Hierarchy Help? Benchmarking Agent Coordination in Event-Driven Industrial Scheduling
arXiv:2605.13172v1 Announce Type: cross
Abstract: Recent advances in agent and multi-agent systems have shown strong performance on tool use, reasoning, and collaborative tasks. However, existing benchmarks mostly evaluate task completion in weakly co…