cs.AI, cs.IT, cs.LG, cs.NA, math.IT, math.NA

Provable Post-Training Quantization: Theoretical Analysis of OPTQ and Qronos

arXiv:2508.04853v2 Announce Type: replace
Abstract: Post-training quantization (PTQ) has become a crucial tool for reducing the memory and compute costs of modern deep neural networks, including large language models (LLMs). Among PTQ algorithms, the …