cs.LG

Predicting LLM Compression Degradation from Spectral Statistics

arXiv:2604.18085v1 Announce Type: new
Abstract: Matrix-level low-rank compression is a promising way to reduce the cost of large language models, but running compression and evaluating the resulting models on language tasks can be prohibitively expens…