Hemanth Saratchandran

Preconditioned Attention: Enhancing Efficiency in Transformers

Hemanth Saratchandran / March 31, 2026

arXiv:2603.27153v1 Announce Type: new
Abstract: Central to the success of Transformers is the attention block, which effectively models global dependencies among input tokens associated to a dataset. However, we theoretically demonstrate that standard…

Author name: Hemanth Saratchandran

Preconditioned Attention: Enhancing Efficiency in Transformers