Out-of-Distribution Generalization of In-Context Learning: A Low-Dimensional Subspace Perspective
arXiv:2505.14808v2 Announce Type: replace
Abstract: The transformer’s remarkable ability to perform in-context learning (ICL) has sparked a wide range of studies designed to understand its strengths and limitations. However, a theoretical understandin…