Shengfan Cao, Francesco Borrelli, Eunhyek Joa

Constrained Policy Optimization via Sampling-Based Weight-Space Projection

Shengfan Cao, Francesco Borrelli, Eunhyek Joa / May 19, 2026

arXiv:2512.13788v2 Announce Type: replace-cross
Abstract: Safety-critical learning requires policies that improve performance without leaving the safe operating regime. We study constrained policy learning where model parameters must satisfy rollout-b…

Author name: Shengfan Cao, Francesco Borrelli, Eunhyek Joa

Constrained Policy Optimization via Sampling-Based Weight-Space Projection