Ivan Yee Lee, Cheng Yang, Taylor Berg-Kirkpatrick

Optical Context Compression Is Just (Bad) Autoencoding

Ivan Yee Lee, Cheng Yang, Taylor Berg-Kirkpatrick / April 7, 2026

arXiv:2512.03643v2 Announce Type: replace
Abstract: DeepSeek-OCR shows that rendered text can be reconstructed from a small number of vision tokens, sparking excitement about using vision as a compression medium for long textual contexts. But this pip…

Author name: Ivan Yee Lee, Cheng Yang, Taylor Berg-Kirkpatrick

Optical Context Compression Is Just (Bad) Autoencoding