Quantifying and Mitigating Self-Preference Bias of LLM Judges
arXiv:2604.22891v2 Announce Type: replace-cross
Abstract: LLM-as-a-Judge has become a dominant approach in automated evaluation systems, playing critical roles in model alignment, leaderboard construction, quality control, and so on. However, the scal…