Guneet S. Dhillon, Javier Gonz\'alez, Teodora Pandeva, Alicia Curth

E-Scores for (In)Correctness Assessment of Generative Model Outputs

Guneet S. Dhillon, Javier Gonz\'alez, Teodora Pandeva, Alicia Curth / April 2, 2026

arXiv:2510.25770v2 Announce Type: replace-cross
Abstract: While generative models, especially large language models (LLMs), are ubiquitous in today’s world, principled mechanisms to assess their (in)correctness are limited. Using the conformal predict…

Author name: Guneet S. Dhillon, Javier Gonz\'alez, Teodora Pandeva, Alicia Curth

E-Scores for (In)Correctness Assessment of Generative Model Outputs