cs.AI, cs.LG, cs.RO, cs.SY, eess.SY, math.OC

Self-Organizing Dual-Buffer Adaptive Clustering Experience Replay (SODACER) for Safe Reinforcement Learning in Optimal Control

arXiv:2601.06540v2 Announce Type: replace-cross
Abstract: This paper proposes a novel reinforcement learning framework, named Self-Organizing Dual-buffer Adaptive Clustering Experience Replay (SODACER), designed to achieve safe and scalable optimal co…