PoTAcc: A Pipeline for End-to-End Acceleration of Power-of-Two Quantized DNNs
arXiv:2605.06082v1 Announce Type: cross
Abstract: Power-of-two (PoT) quantization significantly reduces the size of deep neural networks (DNNs) and replaces multiplications with bit-shift operations for inference. Prior work has shown that PoT-quantiz…