CAMEL: Confidence-Gated Reflection for Reward Modeling
arXiv:2602.20670v2 Announce Type: replace
Abstract: Reward models play a fundamental role in aligning large language models with human preferences. Existing methods predominantly follow two paradigms: scalar discriminative preference models, which are…