Author name: /u/admajic

MTP – The proofs in the puddin! Using it with Qwen3.6-27b

/u/admajic / May 7, 2026

Been running llama.cpp MTP with Qwen3.6-27B Q4_K_M as my daily coding assistant and got curious what was actually happening under the hood. Pulled the metrics from llama-server and charted a full session. A few things stood out — generation speed…

LocalLLaMA

Get faster qwen 3.6 27b

/u/admajic / May 6, 2026

Using 100k context with 3090 with MTP GGUF and getting 50 t/s on llama.cpp Thought I would knowledge share Use https://huggingface.co/RDson/Qwen3.6-27B-MTP-Q4_K_M-GGUF And am17an commit /media/adam/D_DRIVE/LLM/llama-cpp-am17an/build/bin/llama-server -…