Instruction Data Selection via Answer Divergence
arXiv:2604.10448v2 Announce Type: replace
Abstract: Instruction tuning relies on large instruction-response corpora whose quality and composition strongly affect downstream performance. We propose Answer Divergence-Guided Selection (ADG), which select…