Provable Post-Training Quantization: Theoretical Analysis of OPTQ and Qronos
arXiv:2508.04853v2 Announce Type: replace
Abstract: Post-training quantization (PTQ) has become a crucial tool for reducing the memory and compute costs of modern deep neural networks, including large language models (LLMs). Among PTQ algorithms, the …