Author name: /u/_BigBackClock

QWEN3.6 + ik_llama is fast af

running qwen3.6 UD_Q_4_K_M on 16GB vram + 32GB ram with 200k cw @50+ tok/s submitted by /u/_BigBackClock [link] [comments]

running qwen3.6 UD_Q_4_K_M on 16GB vram + 32GB ram with 200k cw @50+ tok/s submitted by /u/_BigBackClock [link] [comments]

using unsloth dynamic quant on 16GB vram + 32GB dram. 200k q8_0 kv cache (context window) submitted by /u/_BigBackClock [link] [comments]