Yuanyang Li, Xue Yang, Longyue Wang, Weihua Luo, Hongyang Chen

ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox

Yuanyang Li, Xue Yang, Longyue Wang, Weihua Luo, Hongyang Chen / May 12, 2026

arXiv:2605.10787v1 Announce Type: new
Abstract: Current LLM agents are proficient at calling isolated APIs but struggle with the “last mile” of commercial software automation. In real-world scenarios, tools are not independent; they are atomic, interd…

Author name: Yuanyang Li, Xue Yang, Longyue Wang, Weihua Luo, Hongyang Chen

ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox