cs.CL

Evaluation Revisited: A Taxonomy of Evaluation Concerns in Natural Language Processing

arXiv:2604.25923v1 Announce Type: new
Abstract: Recent advances in large language models (LLMs) have prompted a growing body of work that questions the methodology of prevailing evaluation practices. However, many such critiques have already been exte…