Let’s Measure Information Step-by-Step: AI-Based Evaluation Beyond Vibes
arXiv:2508.05469v3 Announce Type: replace
Abstract: We evaluate artificial intelligence (AI) systems without ground truth by exploiting a link between strategic gaming and information loss. Building on established information theory, we analyze which …