Euntae Choi, Sumin Song, Sungjoo Yoo

LAQuant: A Simple Overhead-free Large Reasoning Model Quantization by Layer-wise Lookahead Loss

Euntae Choi, Sumin Song, Sungjoo Yoo / May 12, 2026

arXiv:2605.08755v1 Announce Type: new
Abstract: Large reasoning models (LRMs) reach competition-level math and coding accuracy via long autoregressive decoding, making per-token decoding cost a primary deployment concern. Weight quantization is the st…

Author name: Euntae Choi, Sumin Song, Sungjoo Yoo

LAQuant: A Simple Overhead-free Large Reasoning Model Quantization by Layer-wise Lookahead Loss