Maris F. L. Galesloot, Thomas Rhemrev, Nils Jansen

Robust Probabilistic Shielding for Safe Offline Reinforcement Learning

Maris F. L. Galesloot, Thomas Rhemrev, Nils Jansen / May 12, 2026

arXiv:2605.10293v1 Announce Type: cross
Abstract: In offline reinforcement learning (RL), we learn policies from fixed datasets without environment interaction. The major challenges are to provide guarantees on the (1) performance and (2) safety of th…

Author name: Maris F. L. Galesloot, Thomas Rhemrev, Nils Jansen

Robust Probabilistic Shielding for Safe Offline Reinforcement Learning