cs.AI, cs.CL, cs.LG

The Geometric Anatomy of Capability Acquisition in Transformers

arXiv:2602.15997v4 Announce Type: replace-cross
Abstract: Neural networks gain capabilities during training, but the internal changes that precede capability acquisition are not well understood. In particular, the relationship between geometric change…