Seungyub Han, Hyungjin Kim, Jungwoo Lee

Lyapunov-Guided Self-Alignment: Test-Time Adaptation for Offline Safe Reinforcement Learning

Seungyub Han, Hyungjin Kim, Jungwoo Lee / April 30, 2026

arXiv:2604.26516v1 Announce Type: cross
Abstract: Offline reinforcement learning (RL) agents often fail when deployed, as the gap between training datasets and real environments leads to unsafe behavior. To address this, we present SAS (Self-Alignment…

Author name: Seungyub Han, Hyungjin Kim, Jungwoo Lee

Lyapunov-Guided Self-Alignment: Test-Time Adaptation for Offline Safe Reinforcement Learning