Omri Uzan, Yuval Pinter

CharBench: Evaluating the Role of Tokenization in Character-Level Tasks

Omri Uzan, Yuval Pinter / April 8, 2026

arXiv:2508.02591v3 Announce Type: replace
Abstract: Tasks that require character-level reasoning, such as counting or locating characters within words, remain challenging for contemporary language models. A common conjecture is that language models’ r…

Author name: Omri Uzan, Yuval Pinter

CharBench: Evaluating the Role of Tokenization in Character-Level Tasks