cs.AI

BenchScope: How Many Independent Signals Does Your Benchmark Provide?

arXiv:2603.29357v1 Announce Type: new
Abstract: AI evaluation suites often report many scores without checking whether those scores carry independent information. We introduce Effective Dimensionality (ED), the participation ratio of a centered benchm…