HoWToBench: Holistic Evaluation for LLM’s Capability in Human-level Writing using Tree of Writing
arXiv:2604.19071v1 Announce Type: new
Abstract: Evaluating the writing capabilities of large language models (LLMs) remains a significant challenge due to the multidimensional nature of writing skills and the limitations of existing metrics. LLM’s per…