Conformal Policy Control
arXiv:2603.02196v2 Announce Type: replace-cross
Abstract: An agent must try new behaviors to explore and improve. In high-stakes environments, an agent that violates safety constraints may cause harm and must be taken offline, curtailing any future in…