LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs
arXiv:2602.17681v2 Announce Type: replace-cross
Abstract: Post-training quantization (PTQ) is a widely used approach for reducing the memory and compute costs of large language models (LLMs). Recent studies have shown that applying invertible transfor…