cs.AI, cs.LG

GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR

arXiv:2601.09361v3 Announce Type: replace-cross
Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) is a key paradigm for improving large-scale reasoning models. Unlike supervised fine-tuning (SFT), RLVR exhibits distinct optimization dyna…