cs.AI, cs.CL

OPSDL: On-Policy Self-Distillation for Long-Context Language Models

arXiv:2604.17535v1 Announce Type: new
Abstract: Extending the effective context length of large language models (LLMs) remains a central challenge for real-world applications. While recent post-training methods have made progress in long-context scali…