cs.LG

REALM: Reliable Expertise-Aware Language Model Fine-Tuning from Noisy Annotations

arXiv:2604.17289v1 Announce Type: new
Abstract: Supervised fine-tuning of large language models relies on human-annotated data, yet annotation pipelines routinely involve multiple crowdworkers of heterogeneous expertise. Standard practice aggregates l…