cs.LG

LLM-as-Judge on a Budget

arXiv:2602.15481v2 Announce Type: replace
Abstract: LLM-as-a-judge has emerged as a cornerstone technique for evaluating large language models by leveraging LLM reasoning to score prompt-response pairs. Since LLM judgments are stochastic, practitioner…