cs.AI, cs.CL

Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation

arXiv:2601.21464v2 Announce Type: replace
Abstract: Training large language models (LLMs) for non-verifiable tasks, such as creative writing, dialogue, and ethical reasoning, remains challenging due to the absence of ground-truth labels. While LLM-as-…