Antonio Lopardo, Avyukth Harish, Catherine Arnett, Akshat Gupta

Weight Tying Biases Token Embeddings Towards the Output Space

Antonio Lopardo, Avyukth Harish, Catherine Arnett, Akshat Gupta / March 30, 2026

arXiv:2603.26663v1 Announce Type: new
Abstract: Weight tying, i.e. sharing parameters between input and output embedding matrices, is common practice in language model design, yet its impact on the learned embedding space remains poorly understood. In…

Author name: Antonio Lopardo, Avyukth Harish, Catherine Arnett, Akshat Gupta

Weight Tying Biases Token Embeddings Towards the Output Space