Incentivizing High-Quality Human Annotations with Golden Questions
arXiv:2505.19134v2 Announce Type: replace-cross
Abstract: Human-annotated data plays a vital role in training large language models (LLMs), such as supervised fine-tuning and human preference alignment. However, it is not guaranteed that paid human an…