Janaka Chathuranga Brahmanage, Akshat Kumar

Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning

Janaka Chathuranga Brahmanage, Akshat Kumar / April 1, 2026

arXiv:2603.22292v2 Announce Type: replace
Abstract: Sequential decision making using Markov Decision Process underpins many realworld applications. Both model-based and model free methods have achieved strong results in these settings. However, real-w…

Author name: Janaka Chathuranga Brahmanage, Akshat Kumar

Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning