cs.CL, cs.IR, cs.LG

LLMs as Assessors: Right for the Right Reason?

arXiv:2601.08919v2 Announce Type: replace-cross
Abstract: A good deal of recent research has focused on how Large Language Models (LLMs) may be used as judges in place of humans to evaluate the quality of the output produced by various text / image pr…