cs.DS, cs.LG, cs.PF

RSR-core: A High-Performance Engine for Low-Bit Matrix-Vector Multiplication

arXiv:2603.27462v1 Announce Type: cross
Abstract: Matrix-vector multiplication is a fundamental building block in neural networks, vector databases, and large language models, particularly during inference. As a result, efficient matrix-vector multipl…