CharBench: Evaluating the Role of Tokenization in Character-Level Tasks
arXiv:2508.02591v3 Announce Type: replace
Abstract: Tasks that require character-level reasoning, such as counting or locating characters within words, remain challenging for contemporary language models. A common conjecture is that language models’ r…