AutoCompress: Critical Layer Isolation for Efficient Transformer Compression
arXiv:2604.22786v1 Announce Type: new
Abstract: We present AutoCompress, a transformer compression method motivated by an empirical finding: in small transformers, Layer 0 carries disproportionately high task-critical information, with an NTK-based im…