cs.AI, cs.CY, cs.DC, cs.LG

Latency and Cost of Multi-Agent Intelligent Tutoring at Scale

arXiv:2604.24110v1 Announce Type: cross
Abstract: Multi-agent LLM tutoring systems improve response quality through agent specialization, but each student query triggers several concurrent API calls whose latencies compound through a parallel-phase ma…