Zijun Gao, Zhikun Xu, Xiao Ye, Ben Zhou

CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap in Mathematical Reasoning

Zijun Gao, Zhikun Xu, Xiao Ye, Ben Zhou / May 8, 2026

arXiv:2512.18857v3 Announce Type: replace-cross
Abstract: Large language models (LLMs) often solve challenging math exercises yet fail to apply the concept right when the problem requires genuine understanding. Popular Reinforcement Learning with Veri…

Author name: Zijun Gao, Zhikun Xu, Xiao Ye, Ben Zhou

CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap in Mathematical Reasoning