Ming Lei, Christophe Baehr

A Comparative Theoretical Analysis of Entropy Control Methods in Reinforcement Learning

Ming Lei, Christophe Baehr / April 14, 2026

arXiv:2604.09676v1 Announce Type: new
Abstract: Reinforcement learning (RL) has become a key approach for enhancing reasoning in large language models (LLMs), yet scalable training is often hindered by the rapid collapse of policy entropy, which leads…

Author name: Ming Lei, Christophe Baehr

A Comparative Theoretical Analysis of Entropy Control Methods in Reinforcement Learning