KWBench: Measuring Unprompted Problem Recognition in Knowledge Work
arXiv:2604.15760v1 Announce Type: new
Abstract: We introduce the first version of KWBench (Knowledge Work Bench), a benchmark for unprompted problem recognition in large language models: can an LLM identify a professional scenario before attempting to…