cs.CL

Universal YOCO for Efficient Depth Scaling

arXiv:2604.01220v1 Announce Type: new
Abstract: The rise of test-time scaling has remarkably boosted the reasoning and agentic proficiency of Large Language Models (LLMs). Yet, standard Transformers struggle to scale inference-time compute efficiently…