cs.CL

What Makes Good Instruction-Tuning Data? An In-Context Learning Perspective

arXiv:2604.25132v1 Announce Type: new
Abstract: Instruction-tuning datasets often contain substantial redundancy and low-quality samples, necessitating effective data selection methods. We propose an instruction data selection framework based on weigh…