Minghe Shen, Ananth Balashankar, Adam Fisch, David Madras, Miguel Rodrigues

Robust LLM Performance Certification via Constrained Maximum Likelihood Estimation

Minghe Shen, Ananth Balashankar, Adam Fisch, David Madras, Miguel Rodrigues / April 7, 2026

arXiv:2604.03257v1 Announce Type: new
Abstract: The ability to rigorously estimate the failure rates of large language models (LLMs) is a prerequisite for their safe deployment. Currently, however, practitioners often face a tradeoff between expensive…

Author name: Minghe Shen, Ananth Balashankar, Adam Fisch, David Madras, Miguel Rodrigues

Robust LLM Performance Certification via Constrained Maximum Likelihood Estimation