cs.LG

Autoregressive Learning in Joint KL: Sharp Oracle Bounds and Lower Bounds

arXiv:2605.12316v1 Announce Type: new
Abstract: We study the fundamental and timely problem of learning long sequences in autoregressive modeling and next-token prediction under model misspecification, measured by the joint Kullback–Leibler (KL) dive…