Why Expert Alignment Is Hard: Evidence from Subjective Evaluation
arXiv:2605.04972v1 Announce Type: new
Abstract: Aligning large language models with expert judgment is especially difficult in subjective evaluation tasks, where experts may disagree, rely on tacit criteria, and change their judgments over time. In th…