A Visual Guide to Attention Variants in Modern LLMsBy Sebastian Raschka, PhD / March 22, 2026 From MHA and GQA to MLA, sparse attention, and hybrid architectures