EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions
arXiv:2604.16456v1 Announce Type: new
Abstract: Real-time voice assistants must revise task state when users interrupt mid-response, but existing spoken-dialog benchmarks largely evaluate turn-based interaction and miss this failure mode. We introduce…