AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression
arXiv:2604.02119v1 Announce Type: new
Abstract: We introduce a fast low-rank factorization-based framework for compressing large language models that enables rapid compression of billion-parameter models without retraining. Unlike existing factorizati…