cs.AR, cs.LG, cs.PF

PoTAcc: A Pipeline for End-to-End Acceleration of Power-of-Two Quantized DNNs

arXiv:2605.06082v1 Announce Type: cross
Abstract: Power-of-two (PoT) quantization significantly reduces the size of deep neural networks (DNNs) and replaces multiplications with bit-shift operations for inference. Prior work has shown that PoT-quantiz…