CORP: Closed-Form One-shot Representation-Preserving Structured Pruning for Transformers
arXiv:2602.05243v2 Announce Type: replace
Abstract: Transformers achieve strong accuracy but incur high compute and memory cost. Structured pruning reduces inference cost, but most methods rely on retraining or multi-stage optimization, which limits p…