cs.CL, cs.LG

C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences

arXiv:2604.13618v1 Announce Type: cross
Abstract: Rubric-augmented verification guides reward models with explicit evaluation criteria, yielding more reliable judgments than single-model verification. However, most existing methods require costly rubr…