Tongxi Wang - Provide.ai

FBS: Modeling Native Parallel Reading inside a Transformer

Tongxi Wang / April 9, 2026

arXiv:2601.21708v2 Announce Type: replace
Abstract: Large language models (LLMs) excel across many tasks, yet inference is still dominated by strictly token-by-token autoregression. Existing acceleration methods largely patch this pipeline and miss co…

Author name: Tongxi Wang

FBS: Modeling Native Parallel Reading inside a Transformer