cs.CL

Task Vectors, Learned Not Extracted: Performance Gains and Mechanistic Insight

arXiv:2509.24169v2 Announce Type: replace
Abstract: Large Language Models (LLMs) can perform new tasks from in-context demonstrations, a phenomenon known as in-context learning (ICL). Recent work suggests that these demonstrations are compressed into …