cond-mat.dis-nn, cond-mat.stat-mech, cs.LG

Distinct mechanisms underlying in-context learning in transformers

arXiv:2604.12151v1 Announce Type: new
Abstract: Modern distributed networks, notably transformers, acquire a remarkable ability (termed `in-context learning’) to adapt their computation to input statistics, such that a fixed network can be applied to …