Dai Do, Manh Nguyen, Svetha Venkatesh, Hung Le

SPaCe: Unlocking Sample-Efficient Large Language Models Training With Self-Pace Curriculum Learning

Dai Do, Manh Nguyen, Svetha Venkatesh, Hung Le / April 17, 2026

arXiv:2508.05015v2 Announce Type: replace
Abstract: Large language models (LLMs) have shown strong reasoning capabilities when fine-tuned with reinforcement learning (RL). However, such methods require extensive data and compute, making them impractic…

Author name: Dai Do, Manh Nguyen, Svetha Venkatesh, Hung Le

SPaCe: Unlocking Sample-Efficient Large Language Models Training With Self-Pace Curriculum Learning