cs.LG

A Theory of Online Learning with Autoregressive Chain-of-Thought Reasoning

arXiv:2605.06819v1 Announce Type: new
Abstract: Autoregressive generation lies at the heart of the mechanism of large language models. It can be viewed as the repeated application of a next-token generator: starting from an input string (prompt), the …