DiBA: Diagonal and Binary Matrix Approximation for Neural Network Weight Compression
arXiv:2605.05994v1 Announce Type: new
Abstract: In this paper, we propose DiBA (Diagonal and Binary Matrix Approximation), a compact matrix factorization for neural network weight compression. Many components of modern networks, including linear layer…