cs.CL, cs.LG

LongAct: Harnessing Intrinsic Activation Patterns for Long-Context Reinforcement Learning

arXiv:2604.14922v1 Announce Type: new
Abstract: Reinforcement Learning (RL) has emerged as a critical driver for enhancing the reasoning capabilities of Large Language Models (LLMs). While recent advancements have focused on reward engineering or data…