CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning
arXiv:2601.05858v2 Announce Type: replace-cross
Abstract: Large language models (LLMs) have demonstrated competitive performance in zero-shot multilingual machine translation (MT). Some follow-up works further improved MT performance via preference op…