Alignment Data Map for Efficient Preference Data Selection and Diagnosis
arXiv:2505.23114v3 Announce Type: replace
Abstract: Human preference data is essential for aligning large language models (LLMs) with human values, but collecting such data is often costly and inefficient-motivating the need for efficient data selecti…