Rachmad Vidya Wicaksana Putra, Pasindu Wickramasinghe, Muhammad Shafique

QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models

Rachmad Vidya Wicaksana Putra, Pasindu Wickramasinghe, Muhammad Shafique / April 22, 2026

arXiv:2601.00679v2 Announce Type: replace-cross
Abstract: Large Language Models (LLMs) have been emerging as prominent AI models for solving many natural language tasks due to their high performance (e.g., accuracy) and capabilities in generating high…

Author name: Rachmad Vidya Wicaksana Putra, Pasindu Wickramasinghe, Muhammad Shafique

QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models