cs.AI, cs.LG

A Comparative Theoretical Analysis of Entropy Control Methods in Reinforcement Learning

arXiv:2604.09676v1 Announce Type: new
Abstract: Reinforcement learning (RL) has become a key approach for enhancing reasoning in large language models (LLMs), yet scalable training is often hindered by the rapid collapse of policy entropy, which leads…