cs.LG

MOONSHOT : A Framework for Multi-Objective Pruning of Vision and Large Language Models

arXiv:2604.13287v1 Announce Type: new
Abstract: Weight pruning is a common technique for compressing large neural networks. We focus on the challenging post-training one-shot setting, where a pre-trained model is compressed without any retraining. Exi…