cs.CL

From Coarse to Fine: Benchmarking and Reward Modeling for Writing-Centric Generation Tasks

arXiv:2604.27453v1 Announce Type: new
Abstract: Large language models have achieved remarkable progress in text generation but still struggle with generative writing tasks. In terms of evaluation, existing benchmarks evaluate writing reward models coa…