CyclicJudge: Mitigating Judge Bias Efficiently in LLM-based Evaluation
arXiv:2603.01865v3 Announce Type: replace
Abstract: LLM-as-judge evaluation has become standard practice for open-ended model assessment; however, judges exhibit systematic biases that cannot be averaged out by increasing the number of scenarios or ge…