Epistemic Robust Offline Reinforcement Learning
arXiv:2604.07072v1 Announce Type: new
Abstract: Offline reinforcement learning learns policies from fixed datasets without further environment interaction. A key challenge in this setting is epistemic uncertainty, arising from limited or biased data c…