Learning from Disagreement: Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care
arXiv:2604.28010v1 Announce Type: new
Abstract: We reframe clinician overrides of clinical AI recommendations as implicit preference data – the same signal structure exploited by reinforcement learning from human feedback (RLHF), but richer: the annot…