Foresight Arena: An On-Chain Benchmark for Evaluating AI Forecasting Agents
arXiv:2605.00420v2 Announce Type: replace-cross
Abstract: Evaluating the true forecasting ability of AI agents requires environments that are resistant to environments resistant to overfitting, free from centralized trust, and grounded in incentive-co…