cs.CL

SEQUOR: A Multi-Turn Benchmark for Realistic Constraint Following

arXiv:2605.06353v1 Announce Type: new
Abstract: In a conversation, a helpful assistant must reliably follow user directives, even as they refine, modify, or contradict earlier requests. Yet most instruction-following benchmarks focus on single-turn or…