cs.CL

CoAct: Co-Active LLM Preference Learning with Human-AI Synergy

arXiv:2604.17501v1 Announce Type: new
Abstract: Learning from preference-based feedback has become an effective approach for aligning LLMs across diverse tasks. However, high-quality human-annotated preference data remains expensive and scarce. Existi…