cs.AI, cs.LG

Continuous Latent Contexts Enable Efficient Online Learning in Transformers

arXiv:2605.09867v1 Announce Type: cross
Abstract: Large language models (LLMs) exhibit a strong capacity for in-context learning: Given labeled examples, they can generate good predictions without parameter updates. However, many interactive settings …