From Instance Selection to Fixed-Pool Data Recipe Search for Supervised Fine-Tuning
arXiv:2605.12944v1 Announce Type: new
Abstract: Supervised fine-tuning (SFT) data selection is commonly formulated as instance ranking: score each example and retain a top-$k$ subset. However, effective SFT training subsets are often produced through …