Patrick Vossler, Fan Xia, Yifan Mai, Adarsh Subbaswamy, Jean Feng

LLMs Judging LLMs: A Simplex Perspective

Patrick Vossler, Fan Xia, Yifan Mai, Adarsh Subbaswamy, Jean Feng / April 7, 2026

arXiv:2505.21972v3 Announce Type: replace-cross
Abstract: Given the challenge of automatically evaluating free-form outputs from large language models (LLMs), an increasingly common solution is to use LLMs themselves as the judging mechanism, without …

Author name: Patrick Vossler, Fan Xia, Yifan Mai, Adarsh Subbaswamy, Jean Feng

LLMs Judging LLMs: A Simplex Perspective