Benchmarking Real-Time Question Answering via Executable Code Workflows
arXiv:2604.16349v1 Announce Type: cross
Abstract: Retrieving real-time information is a fundamental capability for search-integrated agents in real-world applications. However, existing benchmarks are predominantly static and therefore fail to capture…