cs.AI, cs.LG

Preserve-Then-Quantize: Balancing Rank Budgets for Quantization Error Reconstruction in LLMs

arXiv:2602.02001v2 Announce Type: replace-cross
Abstract: Quantization Error Reconstruction (QER) reduces accuracy loss in Post-Training Quantization (PTQ) by approximating weights as $\mathbf{W} \approx \mathbf{Q} + \mathbf{L}\mathbf{R}$, using a ran…