cs.AI, cs.LG

Design Principles for Sequence Models via Coefficient Dynamics

arXiv:2510.09389v2 Announce Type: replace
Abstract: Deep sequence models, ranging from Transformers and State Space Models (SSMs) to more recent approaches such as gated linear RNNs, fundamentally compute outputs as linear combinations of past value v…