cs.LG

Big2Small: A Unifying Neural Network Framework for Model Compression

arXiv:2603.29768v1 Announce Type: new
Abstract: With the development of foundational models, model compression has become a critical requirement. Various model compression approaches have been proposed such as low-rank decomposition, pruning, quantiza…