Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis
arXiv:2604.14877v1 Announce Type: new
Abstract: Does reinforcement learning genuinely expand what LLM agents can do, or merely make them more reliable? For static reasoning, recent work answers the second: base and RL pass@k curves converge at large k…