cs.AI, cs.LG

On the Existence of Universal Simulators of Attention

arXiv:2506.18739v2 Announce Type: replace
Abstract: Previous work on the learnability of transformers \textemdash\ focused primarily on examining their ability to approximate specific algorithmic patterns through training \textemdash\ has largely been…