PQuantML: A Tool for End-to-End Hardware-aware Model Compression
arXiv:2603.26595v1 Announce Type: new
Abstract: PQuantML is a new open-source, hardware-aware neural network model compression library tailored to end-to-end workflows. Motivated by the need to deploy performant models to environments with strict late…