cs.AI, cs.CL, cs.LG

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

arXiv:2410.05248v4 Announce Type: replace
Abstract: To acquire instruction-following capabilities, large language models (LLMs) undergo instruction tuning, where they are trained on instruction-response pairs using next-token prediction (NTP). Efforts…