LLMbench: A Comparative Close Reading Workbench for Large Language Models
arXiv:2604.15508v1 Announce Type: cross
Abstract: LLMbench is a browser-based workbench for the comparative close reading of large language model (LLM) outputs. Where existing tools for LLM comparison, such as Google PAIR’s LLM Comparator are engineer…