Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge
arXiv:2510.18196v2 Announce Type: replace-cross
Abstract: Large Language Models (LLMs) are commonly used as evaluators in various applications, but the reliability of the outcomes remains a challenge. One such challenge is using LLMs-as-judges for dir…