Evolutionary Task Discovery: Advancing Reasoning Frontiers via Skill Composition and Complexity Scaling
arXiv:2605.11666v1 Announce Type: new
Abstract: The reasoning frontier of Large Language Models (LLMs) has advanced significantly through modern post-training paradigms (e.g., Reinforcement Learning from Verifiable Rewards (RLVR)). However, the effica…