AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators
arXiv:2605.08647v1 Announce Type: cross
Abstract: Multi-agent systems achieve state-of-the-art outcomes through peer collaboration. However, when an agent in the pipeline silently drops a constraint, the system’s final output may look correct even tho…