Deep Delta Learning
arXiv:2601.00417v3 Announce Type: replace-cross
Abstract: Transformer residual streams evolve by additive accumulation: each layer appends a feature update to a shared hidden state, but has no direct mechanism for replacing content that has become obs…