cs.LG, math.ST, stat.ML, stat.TH

Out-of-Distribution Generalization of In-Context Learning: A Low-Dimensional Subspace Perspective

arXiv:2505.14808v2 Announce Type: replace
Abstract: The transformer’s remarkable ability to perform in-context learning (ICL) has sparked a wide range of studies designed to understand its strengths and limitations. However, a theoretical understandin…