The Geometric Anatomy of Capability Acquisition in Transformers
arXiv:2602.15997v4 Announce Type: replace-cross
Abstract: Neural networks gain capabilities during training, but the internal changes that precede capability acquisition are not well understood. In particular, the relationship between geometric change…