Andrew Kiruluta - Provide.ai

Compressed-Sensing-Guided, Inference-Aware Structured Reduction for Large Language Models

Andrew Kiruluta / April 17, 2026

arXiv:2604.14156v1 Announce Type: new
Abstract: Large language models deliver strong generative performance but at the cost of massive parameter counts, memory use, and decoding latency. Prior work has shown that pruning and structured sparsity can pr…

Author name: Andrew Kiruluta

Compressed-Sensing-Guided, Inference-Aware Structured Reduction for Large Language Models