Jonathan Hayase, Alisa Liu, Noah A. Smith, Sewoong Oh

Sampling from Your Language Model One Byte at a Time

Jonathan Hayase, Alisa Liu, Noah A. Smith, Sewoong Oh / May 8, 2026

arXiv:2506.14123v3 Announce Type: replace
Abstract: Tokenization is used almost universally by modern language models, enabling efficient text representation using multi-byte or multi-character tokens. However, prior work has shown that tokenization c…

Author name: Jonathan Hayase, Alisa Liu, Noah A. Smith, Sewoong Oh

Sampling from Your Language Model One Byte at a Time