Just did an analysis on ICLR 2025 vs 2026 scores and WOW [D]

Just did an analysis on ICLR 2025 vs 2026 scores and WOW [D]

Per https://paperreview.ai/tech-overview, the scores corr between 2 human is about 0.41 for ICLR 2025, but in my current project I am seeing a much lower corr for ICLR 2026. So I ran the metrics for both 2025 and 2026 and it is crazy. I used 2 metrics, one-vs-rest corr and half-half split corr. All data are fetched from OpenReview.

I do know that top conf reviews are just a lottery now for most papers, but i nenver thought it is this bad.

2025 avg-score SD: 1.253, mean wavg-scoreer human SD: 1.186

2026 avg-score SD: 1.162, mean within-paper human SD: 1.523

https://preview.redd.it/klay6nijipug1.png?width=2090&format=png&auto=webp&s=92c85470bc72ff03584f38f160d3d09f530b55e2

  • 2025 avg-score SD: 1.253, mean within-paper human SD: 1.186
  • 2026 avg-score SD: 1.162, mean within-paper human SD: 1.523
submitted by /u/Striking-Warning9533
[link] [comments]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top