Jeremy Qin, Maksym Andriushchenko

QuantSightBench: Evaluating LLM Quantitative Forecasting with Prediction Intervals

Jeremy Qin, Maksym Andriushchenko / April 20, 2026

arXiv:2604.15859v1 Announce Type: cross
Abstract: Forecasting has become a natural benchmark for reasoning under uncertainty. Yet existing evaluations of large language models remain limited to judgmental tasks in simple formats, such as binary or mul…

Author name: Jeremy Qin, Maksym Andriushchenko

QuantSightBench: Evaluating LLM Quantitative Forecasting with Prediction Intervals