Lishui Fan, Yu Zhang, Mouxiang Chen, Zhongxin Liu

ReCode: Reinforcing Code Generation with Reasoning-Process Rewards

Lishui Fan, Yu Zhang, Mouxiang Chen, Zhongxin Liu / May 6, 2026

arXiv:2508.05170v3 Announce Type: replace-cross
Abstract: In practice, rigorous reasoning is often a key driver of correct code, while Reinforcement Learning (RL) for code generation often neglects optimizing reasoning quality. Bringing process-level …

Author name: Lishui Fan, Yu Zhang, Mouxiang Chen, Zhongxin Liu

ReCode: Reinforcing Code Generation with Reasoning-Process Rewards