SAVGO: Learning State-Action Value Geometry with Cosine Similarity for Continuous Control
arXiv:2605.00787v1 Announce Type: new
Abstract: While representation and similarity learning have improved the sample efficiency of Reinforcement Learning (RL), they are rarely used to shape policy updates directly in the action space. To bridge this …