Stavros Orfanoudakis, Pedro P. Vergara

SAVGO: Learning State-Action Value Geometry with Cosine Similarity for Continuous Control

Stavros Orfanoudakis, Pedro P. Vergara / May 4, 2026

arXiv:2605.00787v1 Announce Type: new
Abstract: While representation and similarity learning have improved the sample efficiency of Reinforcement Learning (RL), they are rarely used to shape policy updates directly in the action space. To bridge this …

Author name: Stavros Orfanoudakis, Pedro P. Vergara

SAVGO: Learning State-Action Value Geometry with Cosine Similarity for Continuous Control