IO-SVD: Input-Output Whitened SVD for Adaptive-Rank LLM Compression
arXiv:2605.15626v1 Announce Type: new
Abstract: Large language models deliver strong performance across language and reasoning tasks, but their storage and compute costs remain major barriers to deployment in resource-constrained and latency-sensitive…