cs.LG

Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning

arXiv:2508.19900v2 Announce Type: replace
Abstract: Offline reinforcement learning (RL) enables learning effective policies from fixed datasets without any environment interaction. Existing methods typically employ policy constraints to mitigate the d…