cs.LG, stat.ML

IMPACT: Importance-Aware Activation Space Reconstruction

arXiv:2507.03828v4 Announce Type: replace
Abstract: Large language models (LLMs) achieve strong performance across diverse domains but remain difficult to deploy in resource-constrained environments due to their size. Low-rank compression is a common …