cs.AI, cs.CL, cs.LG, cs.SD

EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions

arXiv:2604.16456v1 Announce Type: new
Abstract: Real-time voice assistants must revise task state when users interrupt mid-response, but existing spoken-dialog benchmarks largely evaluate turn-based interaction and miss this failure mode. We introduce…