Learning Preference-Based Objectives from Clinical Narratives for Sequential Treatment Decision-Making
arXiv:2604.10783v1 Announce Type: cross
Abstract: Designing reward functions remains a central challenge in reinforcement learning (RL) for healthcare, where outcomes are sparse, delayed, and difficult to specify. While structured data capture physiol…