Manan Gupta, Inderjeet Nair, Lu Wang, Dhruv Kumar

Context Over Content: Exposing Evaluation Faking in Automated Judges

Manan Gupta, Inderjeet Nair, Lu Wang, Dhruv Kumar / April 17, 2026

arXiv:2604.15224v1 Announce Type: cross
Abstract: The $\textit{LLM-as-a-judge}$ paradigm has become the operational backbone of automated AI evaluation pipelines, yet rests on an unverified assumption: that judges evaluate text strictly on its semanti…

Author name: Manan Gupta, Inderjeet Nair, Lu Wang, Dhruv Kumar

Context Over Content: Exposing Evaluation Faking in Automated Judges