/u/enrique-byteshape

Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs

/u/enrique-byteshape / May 20, 2026

Hey r/LocalLLaMA, We’ve released our ByteShape Qwen 3.6 35B GGUF quantizations in two families: standard NTP (Next Token Prediction or non-MTP) and MTP. Blog / Download NTP Models / Download MTP Models TL;DR For NTP, “pick the largest quant that…

Author name: /u/enrique-byteshape

Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs