StateX: Enhancing RNN Recall via Post-training State Expansion
arXiv:2509.22630v2 Announce Type: replace-cross
Abstract: Recurrent neural networks (RNNs), such as linear attention and state-space models, have gained popularity due to their constant per-token complexity when processing long contexts. However, thes…