cs.AI, cs.DC

Space Filling Curves is All You Need: Communication-Avoiding Matrix Multiplication Made Simple

arXiv:2601.16294v2 Announce Type: replace-cross
Abstract: General Matrix Multiplication (GEMM) is the cornerstone of HPC workloads and Deep Learning. State-of-the-art vendor libraries tune tensor layouts, parallelization schemes, and cache blocking to…