SPaCe: Unlocking Sample-Efficient Large Language Models Training With Self-Pace Curriculum Learning
arXiv:2508.05015v2 Announce Type: replace
Abstract: Large language models (LLMs) have shown strong reasoning capabilities when fine-tuned with reinforcement learning (RL). However, such methods require extensive data and compute, making them impractic…