TimeSeek: Temporal Reliability of Agentic Forecasters
arXiv:2604.04220v1 Announce Type: new
Abstract: We introduce TimeSeek, a benchmark for studying how the reliability of agentic LLM forecasters changes over a prediction market’s lifecycle. We evaluate 10 frontier models on 150 CFTC-regulated Kalshi bi…