When the Gold Standard Isn’t Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content
arXiv:2512.17738v2 Announce Type: replace
Abstract: User-generated content (UGC) is characterised by frequent use of non-standard language, from spelling errors to expressive choices such as slang, character repetitions, and emojis. This makes evaluat…