Yuan Sui, Bryan Hooi

Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation

Yuan Sui, Bryan Hooi / May 8, 2026

arXiv:2601.21464v2 Announce Type: replace
Abstract: Training large language models (LLMs) for non-verifiable tasks, such as creative writing, dialogue, and ethical reasoning, remains challenging due to the absence of ground-truth labels. While LLM-as-…

Author name: Yuan Sui, Bryan Hooi

Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation