Latency and Cost of Multi-Agent Intelligent Tutoring at Scale
arXiv:2604.24110v1 Announce Type: cross
Abstract: Multi-agent LLM tutoring systems improve response quality through agent specialization, but each student query triggers several concurrent API calls whose latencies compound through a parallel-phase ma…